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Preface 


These lecture notes cover the material of a four hour course on Lie transformation 
groups. The goal of this course is to provide the foundations of symmetry group 
analysis of differential equations, a vast and very active research area with con¬ 
nections to Lie group theory, differential geometry, differential equations, calculus 
of variations, integrable systems, and mathematical physics, among others. The 
standard reference in this field is P. J. Giver’s seminal work and to a large part 
this set of notes will closely follow his exposition. We do, however, take a slightly 
different approach to the foundations of the field and try to give a reasonably com¬ 
plete exposition of the underlying theory of local transformation groups, which in 
turn is based on H. Sussmann’s study [15] of the integrability of distributions of 
non-constant rank. For connecting this material with Giver’s approach, we have 
found V. Schmidt’s doctoral thesis m very helpful. 

I have tried to make these notes self-contained, pre-supposing only that the reader 
be familiar with the material covered in basic courses on Differential Geometry 
and Lie Groups. More precisely, the presentation is a direct continuation of my 
courses [9] and [TT] (which in turn is mostly based on [3]), as well as the chapter 
on submanifolds in m- In particular, these notes supply proofs for many results 
that are left to the reader in M- Clearly what we can provide here is only a first 
step into the field, but it should allow the reader to delve into the subject, e.g., by 
further studying HU. 

I am greatly indebted to Roman Popovych for carefully reading the entire manuscript 
and for suggesting numerous improvements. 


Michael Kunzinger, summer term 2015 
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Chapter 1 


Lie transformation groups 


1.1 Basic concepts 

Throughout this course, by a manifold we mean a smooth (C°°) manifold. However, 
we do not a priori assume further properties of the natural manifold topology like 
Hausdorff or second countable. In this chapter we will follow [3] . 

We begin by recalling El Def. 16.1]: 

1.1.1 Definition. A transformation of a manifold M is a diffeomorphism from 
M onto M. A group G acts on M as a transformation group (on the left) if there 
exists a map G x M ^ M satisfying: 

(i) For every g € G the map d>g: m i—>■ ^{g,m) is a transformation of M. 

(a) For all g, h G G, o = ^gh- 

In particular, d>e = idM- 

G acts effectively on M if $g(m) = m for all m G M implies g = e. G acts freely 
on M if it has no fixed points, i.e., if $g(m) = m for some m G M implies that 
g = e. 

Our main object of study is introduced next. 

1.1.2 Definition. A Lie group G acts as a Lie transformation group on a manifold 
M if there exists a smooth surjection G x M ^ M such that for all g, h G G, 

= <I>gh. 

We first note that any Lie transformation group is a transformation group in the 
sense of ll.l.ll To see this, note first that $ being surjective implies that <l?e = idM- 
In fact, given m G M there exist m' G M, h G G with = m. Therefore, 

$e(m) = $6 o ^hijn') = ^hijn') = m. Consequently, = $g-i for all g G G, 
so each $g is a transformation of M, and we have verified all conditions from Def. 
11.1.11 We will often briefly write gm instead of $g(m). 

1.1.3 Example. Let X be a complete vector field on a Hausdorff manifold M. 
Then the flow of X defines a smooth map Fl^ : Kx M —M with 4'(t, m)) 
= 4'(s + t,m) for all s, t G M. and all m G M. T therefore induces a Lie trans¬ 
formation group of the Lie group (K.,-I-) on M. Each := m '^{t,m) is a 
transformation on M and 1 1 —>■ 'I't is a homomorphism of M into the group of trans¬ 
formations on M. 
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1.1.4 Lemma. Let G be a Lie group acting as a Lie transformation group on M 
and let H be a Lie subgroup of G. Then also H aets as a Lie transformation group 
on M. 


Proof. Let j : H ^ G he the natural injection and let $: G x M —>■ M be as in 
11.1.21 Then '■ H x M ^ M, := $ o (j x idM) does the trick. □ 

If G acts as a transformation group on M via 4) then a subset A oi M is called 
invariant under G if 4)(G x A) C A. 

1.1.5 Proposition. Lf a regular submanifold M' of M is invariant under the 
action of a Lie transformation group G on M then G acts naturally on M' as a Lie 
transformation group. 

Proof. Let j : Gx M' ^ GxM he the natural injection. Then since M' is regular, 
the map 4>' : G x M' —?> M' induced by $ o j is smooth (see [TOl 1.1.14]), hence it 
defines an action of G on M'. □ 


1.1.6 Example. GL(n, R) acts on K" as a Lie transformation group via {A,x) i—>■ 
Ax. Bv ll.1.41 so does 0(n,IR.). Since is a regular submanifold of R." invariant 
under 0(n,R), 11.1.51 implies that 0(n,R) acts on 5"“^ as a Lie transformation 
group. 

For the notion of quotient manifold, cf. El Sec. 15]. We say that a transformation 
group G preserves an equivalence relation p on M if (mi, m 2 ) € p and g € G imply 
{gmi,gm2) G p. 

1.1.7 Proposition. If M/p is a quotient manifold of M and if the equivalence 
relation p is preserved by a Lie transformation group G on M then G acts naturally 
on Mjp as a Lie transformation group. 


Proof. Denote hy tt : M ^ M/p the natural surjection. Then id x tt : G x 
M ^ G X (M/p) is a submersion, so G x {M/p) is a quotient manifold of G x M. 
Moreover, the smooth map 7 ro<i>; GxM —>■ M/p is an invariant of the corresponding 
equivalence relation on G x M since {g, mi) ~ {g, m 2 ) if and only if {gmi, gm 2 ) G p, 
which implies tt o $( 5 , mi) = 7 r(pmi) = 7 r(pm 2 ) = tt o $( 5 , m 2 ). By [TT| 15.13] it 
therefore projects to a smooth map 

4>p : G X (M/p) -)> M/p, {g, 7 r(m)) 7 r(pm) 
which defines the required action of G on M/p since 


^p( 5 i,^’p( 52 , 7 r(m))) = 4>p(pi, tt o $( 52 , m)) = tt o 4>(pi, $(p 2 , m)) 
= TT o $(pip 2 , m) = 4 >p(piP 2 , 7 r(m)). 


□ 


1.1.8 Example. Any Lie group G acts on itself as a Lie transformation group 
via multiplication p: G x G —)> G. A subgroup H oi G determines an equivalence 
relation p on G by 

( 51 , 52 ) G p :<t^ 3/1 G M : 51 = P 2/1 

and this relation is preserved by the transformation group. If the set of orbits G/H 
(i.e., the set of left cosets) is a quotient manifold of G (cf. [TT| Sec. 15, 20]) then 
G acts naturally on G/H as a Lie transformation group with action $ given by 
{g,aH) I—>■ {ga)H. A sufficient condition for G/H to be a quotient manifold of G is 
that H is closed, not open, and connected (see (TT] 20.5]). 
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1.1.9 Proposition. If H is a closed subgroup of a Lie group G such that G/H is 
a quotient manifold ofG, then G/H is a Hausdorff manifold. 

Proof. Recall from [111 15.9] that the natural manifold topology of G/H is the 
quotient topology with respect to the natural projection tt : G —>■ G/H. Suppose 
first that gH S G/H is distinct from H. Then gH is a closed subset of G that 
does not contain e. Hence there exists a neighborhood G of e in G which is disjoint 
from gH. Pick a neighborhood V of e in G such that V~^V C U. We show that 
•7T{Vg) and tt{V) are disjoint neighborhoods of gH and H, respectively. Indeed, if 
tt{V g) n 7r(y) 7 ^ 0 then there exist a, b G V such that agH = bH, so a~^b G gH. 
But a~^b G V~^V C U, so we obtain a contradiction. 

In general, if gH and g'H are distinct elements of G/H then so are H and {g~^g')H 
and by what we have shown above they possess disjoint neighborhoods W and W'. 
Denoting by (/> the action of G on G/H from 11.1.81 it follows that $g(IT) and 
^g{W') are disjoint neighborhoods of gH and g'H. □ 

Recall from m Sec. 4, 6] that a vector field X G X(G) is called left- (resp. right)- 
invariant if L*X = X (resp. H*X = X) for each g G G. We denote by Xl(G) (resp. 
Xr{G)) the space of left- (resp. right)-invariant vector fields on G. Also, we identify 
Xl{G) with the Lie algebra g via TgG 9 u >->■ L’' G Xl(G), where L^{g) := Tf,Lg{v). 
The inverse of this map is A i—>■ X(e). 

Similarly, g is isomorphic to Xfi(G) via TgG ^ v ^ R" G Xii{G) with R^'ig) := 
TeRg{v). We usually equip g with the Lie algebra structure induced by [u,r(;] := 
[L’',L™](e). Alternatively, we may set [v,w]}i := [i?", i?“](e). The resulting Lie 
algebra will be denoted by g/j. We have: 

1.1.10 Proposition. Let G be a Lie group. Then 

(i) The map F : L'" —R^ is a Lie algebra isomorphism from Xl(G) onto 

XniG). 

(ii) The map f:v^ —v is a Lie algebra isomorphism from g onto qr. 

Proof, (i) By what was said above, A is a linear isomorphism, since both L*' 
u G g and v i—>■ R^ are. Furthermore, by m 6.1 (iii)] we have: 

[F(L’'),F(L™)] = [R",R^] = = -RbH = F(L[^'’"1) = A([L^L’"]), 

so A is a Lie algebra isomorphism. 

(ii) We have [u,w]fl: = [i?’',i?’"](e) = = —[u,w]. Hence 

[f(.v)J{w)]R = [-v,-w]r = [v,w]r = -[u,u;] = f{[v,w]). 


□ 


1.1.11 Remark. By m 6.1] The map F from 11.1.101 can also be written as 
X I—>■ v*X. In fact, if A G Xl(G) then since v o Rg = Lg-i o i/ we have 

R*g{v*X) = {vo Rg)*X = (Lg-i o v)*X = A) = v*X, 

so V*X G Xr(G), and by symmetry i^* : Xl{G) —>■ Xr{G) is an isomorphism. 
Moreover, v*{L'"){e) = {Tev)~^ o L'" o v{e) = —u, so iy*{L'") = R~'" = F{L'"). 
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Let now ^rGxM—J-Mbea Lie transformation group and let m £ M. Then the 
map 


'■ G ^ M 
^m{g) := ‘^{g,m) 

is smooth. Given any v £ TgG = g, let 

$(?;) : M TM 
^{v){m) := Te^m{v) 


( 1 . 1 . 1 ) 


( 1 . 1 . 2 ) 


Then $(u)(to) = T$(?;,0m), so $(u) is smooth. Also, with tt : TM —>■ M and 
TT : T{G X M) G X M the canonical projections, tt o ^(v){m) = <i)(7r(u, 0^)) = 
<i>(e,m) = TO, so $(u) is a smooth section of TM, i.e., $(?;) £ X(M). We set 


3?(G,M):={$(u) |u£g}. (1.1.3) 

1R(G,M) is called the Killing algebra of <&. Since v H> $(u) is linear, it is a finite¬ 
dimensional vector space. In fact, we even have: 


1.1.12 Proposition. The map $: u i—>■ $(u) is a Lie algebra homomorphism from 
qr onto 3i(G, M). 


Proof. Given any n £ g, we first show that for any to £ M, the right-invariant 
vector field T on G with T(e) = v is $m-related to <i>(n). In fact, noting that 
^moRgih) = ^m{hg) = $(/i, $^( 5 )) = (^) with to' := ^m{g), we obtain: 

Tg^miYig)) = Tg<^>raiT,Rg{v)) = Te{^m o Rg){v) = T,<^m'{v) = $(n)($^(5)). 

Thus for any u £ g, 

Tg$^(r( 5 )) = <l>(u)(<i>^(g)). (1.1.4) 

From this, by [11] 4.4] we conclude that for Z = another right-invariant vector 
field we have 

[$(u),$(n;)]o$^ =T$™o[y,Z]. 

Inserting e, this gives 

[4>(u),4>(w)]r„ =Te^m{\Y,Z]e) = Te^m{[v, w\r) 

(where we used that cf. II.I.IOT) . Thus, finally, [<i>(u), ^(w)] = 

‘h(['y,w]fi), as claimed. □ 

In the following result, we will start denoting the time variable in the flow map of a 
vector field with e, for the sake of compatibility with the case of symmetry groups 
of differential equations later on. 


1.1.13 Proposition. If M is T 2 , then every vector field $(ri) £ '31{G,M) is com¬ 
plete, with flow 

F 1 ^^’'^(to) = 4)(exp(eu), to) = exp(£z;) • to. 


In particular, 


4>(u)(to) 



4)(exp(£i;), to). 


(1.1.5) 


Proof. In the proof of ll.l.l2l we have seen that R" is 4>m-related to 4>(r)). Thus by 
[TTl 7.2] it follows that o Fl^ = FlJ^"^ o 4)^. Moreover, by [TT] 8 . 2 ], Fl^ ( 5 ) = 
exp(£z;) • g. Therefore, 

Flf^’'^(TO) = Flf^’'^ o 4)m(e) = $m(exp(£r;) • e) = 4)(exp(£v), to). 
Gompleteness of 4>(ti) is immediate from this description. □ 
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1.1.14 Example. Let —^-Mbea Lie transformation group with Lie 

group (M,+) and r 2 -nianifold M. Then is generated by ^’( 7 ^|g)- More- 

$( _d_ I ) 

over, the flow of ‘^(^Iq) is given by $ itself: Fb “ (m) = ^{e,m): this is 
immediate from ll.l.T!ll since exp(e ^|q) = e. 

Recall from m Sec. 16] that, given a Lie transformation group G x M ^ M, 
we set 

K := {k G G \ = idM}- 

Then K is a normal subgroup of G and m 16.3] demonstrates that the quotient 
group G/K acts effectively on M. Moreover, K is closed in G since we can write 
K = Hence by [TTl 21.6], if K is not open then the quotient 

manifold G/K is a Lie group. The exceptional case here is not very interesting 
though: 

1.1.15 Remark. If K is open, then 3i{G,M) = 0. Indeed, if K is open (and 
closed) then since e G K we must have Ge Q K. Then because for all u G g, 
exp(sv) G Ge it follows that <i>m(exp(eu)) = m for all m G M, and differentiating 
with respect to e at e = 0 implies that $(u)(m) = Te^rn{v) = 0 for all m. Thus 
<i)(u) = 0 and thereby 3l(G', M) = 0. 

1.1.16 Proposition. If K is not open then G/K acts effectively on M as a Lie 
transformation group and 'R{G/K,M) = 3?(G,M). 

Proof. Let 


: [G/K) X M ^ M 
{gK,m) 1 -^ 

To see that is well-defined and smooth let tt : G ^ G/K he the natural projec¬ 
tion. Then ttx Mm : GxM — {G/K) x M is a submersion (cf. [TTJ 20.5]) and $ is an 
invariant of the corresponding equivalence relation on G x M: if {gi, mi) ^ {g 2 , m 2 ) 
then giK = g 2 K and mi = m 2 , so $(gi,TOi) = {giK,mi) — {g 2 K,m 2 ) = 

^{g 2 ,m 2 )- Thus by [TTl 15.13], the map being the projection of $, is itself 
smooth. That $' gives an effective action has been shown in m 16.3]. 

Finally, for any m G M we have ° so for any w G g we get Te^m{v) = 

o Tf.Tr{v), i.e., <l>(u) = <i>'(re7r(u)). Since Tf-n is a surjection, this gives 
Jl{G/K,M) = Jl{G,M). □ 


1.1.17 Proposition. Let ^ be a transformation group that acts effectively on a 
T 2 -manifold M. Then the map 


$: gR^H{G,M) 

defined by (11.1.21) is a Lie algebra isomorphism. 

Proof. Using [T7l.l2l it only remains to show that $ is injective. Thus let <I>(u) = 0. 
Then bv ll.l.lSl it follows that exp(£u) • m = m for all m G M and all e G K.. Since 
G acts effectively we conclude that exp(£u) = e for all e, giving u = 0. □ 
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1.1.18 Example. Let G = GL(n,K.) and M = R" with $: {A,x) ^ A - x. To 
determine the map (11.1.2|) we let A''^ be the standard coordinates on GL(n,R) and 
x^ those on R". Then 



so $ = I—>■ ^ Since G act effectively, $ is an isomorphism. 

1.1.19 Remark. There is a strong converse of ll.l.lTl due to R. Palais (see [151 
p. 95]): If A is a finite-dimensional Lie algebra of complete veetor fields on a T 2 - 
manifold M, then there exists a connected Lie group G which acts effectively on M 
as a Lie transformation group and sueh that A = 3i(G, M). 

1.2 Orbits under a Lie transformation group 

From [111 Sec. 16] we know that any transformation group $ acting on a manifold 
M induces an equivalence relation on M: to ^ m' iff there exists some g € G with 
m' = gm. The equivalence class of any x € M is called the orbit of to under G. It 
is the range of the map = g ^{g, m), G —>■ M. 

1.2.1 Definition. Let G x M ^ M be a transformation group on a manifold 

M. For any m € M, the subgroup Gm ■= = {<? S G j g • to = to} is called 

the isotropy group of m. 

1.2.2 Remark, (i) Isotropy groups at equivalent points in M are conjugate sub¬ 
groups of G. In fact, suppose that to' = gm. Then {gGmg~^)'m' = to', so 
gGmg~^ c Gm’, and analogously g~^Gm'g Gm, so gGmg~^ = Gm'- 

(ii) For any to G M, the map : G ^ M projects to a map 'i’m ■ GjGm —>■ M 
defined by gGm '—>■ < 7 w. The range of is the orbit of to. Also, is injective: 
if '^migiGm) = ff'm( 52 Gm) then 51 TO = g 2 m, so gf^g2 G Gm, i-e., giGm = g2Gm- 

1.2.3 Remark. Let G x M M he a Lie transformation group on a T 2 - 
manifold M. Then any Gm is a closed subgroup of G. Hence by m 21-7], Gm is 
either discrete or it admits a unique structure as a (regular) submanifold of G. In the 
latter case it is also a Lie subgroup of G. If to' = gm then bv ll.2.2h i'). Gm is mapped 
onto Gm' by the diffeomorphism LgoRg-i. Thus the isotropy groups at points of an 
orbit are either all discrete or are regular submanifolds and Lie subgroups of G that 
are pairwise diffeomorphic (since Gm, Gm' are regular submanifolds, the restriction 
oi Lg o Rg-i is also a diffeomorphism from Gm onto Gm')- 

If Gm is open then it must be a union of connected components of G, which, 
by [TT] 2.4] are precisely the cosets of the normal subgroup Gg. If, for example, 
G = giGe.iI- ■ -iJgkGe and Gm = 5 iGeU- • -iJgiGe, then GjGm = {giGm, - - -, gkGm}, 
since for g G gjGe we have 



Also, the orbit of to only consists of finitely many points (namely 'I’m{G/Gm) = 
{51 - m,. - -, gk ■ to}). Otherwise, we have: 
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1.2.4 Theorem. Let Gx M ^ M be a transformation group on a T 2 -manifold 
M and let m £ M. If the isotropy group Gm of m is not open in G then the map 
'f'm from\L2f^ (ii) is an injective immersion of the quotient manifold GjGm Into 

M. 

Proof. Since o (with tt : C? —>■ G/Gm), '^m is the projection of the 

smooth map 4)^, hence is itself smooth (see m 1-1-9]). Also, GfGm is a quotient 
manifold of G by m 21.5]. Since is injective bv 11.2.21 liil. it remains to show 
that its rank in any point equals the dimension of C?/Gm- Since tt is a submersion, 
this is the case if and only if the rank of is everywhere equal to dim(G/Gm). 
We begin by showing that this is true at e. 

Let V £ TeG such that Te4>m(u) = 0. By (11.1.21) this means that the vector field $(?;) 
has a zero at m. Therefore II. 1.131 implies that F1 ^*'’'^(to) = exp(£i;)m = m for all s, 
i.e., exp(ez;) £ Gm for all e. Now if Gm is discrete then the image of e —>■ exp(eu), 
being connected, must consists solely of e £ G, so u = ^|gexp(e'(;) = 0. In this 
case, then, Te4>m is injective, and so the rank of T^^m equals the dimension of G, 
and thereby the dimension of GjGm- 

If Gm is non-discrete then by II. 2. 31 it is a regular submanifold of G. Hence e i—>■ 
exp(eu) is smooth as a map into Gm (see [TOl 1.1.14]), and so ?; = ^|jjexp(eu) £ 
TeGm- Altogether, we obtain that ker(re4’m) C TgGm- Conversely, ^m is constant 
on Gm, so Te^m\TeGm = 0, hencc in fact ker(Te4>m) = TeGm- Consequently, using 
m 21.7] we obtain 

rk(Te4>m) = dimG — dirnGm = dirnG/Gm. 

Finally, if g is an arbitrary point in G then 4>m o Rg = ^m', where m' = gm. Then 
since Rg is a diffeomorphism we have 

rkg($m) = rke($m/) = diiJiG/Gm'- 

Now bv ll.2.3I G.^ and Gm' are either both discrete or they are diffeomorphic, so we 
conclude that the rank of equals dim G/Gm for every m £ M. □ 


1.2.5 Corollary. Under the assumptions of \1.2.4[ the orbit G ■ m of any m G M 
can be endowed with the structure of an immersive submanifold of M diffeomorphic 
to G/ Gm- 

Proof. For clarity, we write ^m for 'I'm, viewed as a (bijective) map from GjGm 
to G • m. Declaring ^^m to be a diffeomorphism provides G • to with a differentiable 
structure with respect to which the inclusion map j : G • to ^ M is an immersion 
since j o ^m = 4'm : GjGm —>■ M is an immersion. □ 


1.2.6 Remark. Suppose that G is connected and let to £ M. Bv ll.2.4l the orbit 
G • TO can be discrete only if Gm is open. In this case, Gm is open and closed in G, 
so Gm = G and therefore G • to = {to}. 

1.2.7 Example. Let M = G = (M,-I-), and X = —ydx + xdy the rotation 
vector held on M. Then we obtain a Lie transformation group <I> by 

4>: (e, {x,y)) !->■ Fl^{x,y) = {xcose — ysine^xsine -F ycose). 

The orbits of $ are the regular submanifolds {x^ + = const} and the singleton 

1 ( 0 , 0 )}. 
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Recall that G is said to act transitively on M if for any m, m' G M there exists some 
g G G with gm = m!. Such a group action possesses only a single orbit, namely the 
manifold M itself. Bv ll.2.2l (i) this means that any di'm is a bijection from GjGm 
onto M. To further elaborate on this, we will need the following auxiliary result: 

1.2.8 Lemma. Let M"* and be manifolds and suppose that M is second count¬ 
able and that m < n. Then an immersion : M ^ N cannot be onto any open 
subset W of N. 

Proof. It suffices to take for W the domain of a chart % in N. Let p be any 
point in and choose charts {U,(p = ..., x™)) around p in M and 

{V, T] = {y^, ..., 2 /”)) around 4'(p) such that 4'(t7) C V C W and such that 77 o vp o 
79-1 = (x\...,x^) (x\...,x’",0,...,0). Then ??(«'([/)) C R™ x {0} C R", so 

it has Lebesgue measure 0. It follows that also the image x ('!'([/)) of this set under 
the smooth map x° Lebesgue measure 0. 

As p varies in 'I'“^(IT), the domains U cover the set As M is second 

countable we may extract a countable subcover {Uk | fc G N} from this collection. 
Then the sets ^{Uk) cover W fl entailing 

x(ITnvI/(M))= U x(^'(t^fe))- 
fcSN 

But then x{W fl has Lebesgue measure 0 and so it cannot be all of x{W). 

It follows that 'I'(M) cannot contain all of IT. □ 

Using this, we can prove: 

1.2.9 Proposition. Let G be a second countable Lie group that acts transitively 
as a Lie transformation group on the T 2 -manifold M. Then M is diffeomorphic to 
GjGm, for any m G M. 

Proof. Fix any m G G. If Gm is open, then so is any gGmi hence any point 
in GjGm (because TT~^{TT{g)) = gGm), which is therefore discrete (cf. [TTl Rem. 
20.1]). Otherwise, by [TTl 21.5] GjGm possesses a differentiable structure as a 
quotient manifold of G. In both cases, the quotient map tt : G —>■ GjGm is open 
and continuous, so also the topology of GjGm is second countable. Hence if GjGm 
were discrete it would be countable. But '^m is bijective, so this would imply that 
M was countable, which is impossible. Hence GjGm is not discrete, and so it is a 
quotient manifold of G with a countable basis for its topology. Also, bv ll.2.4l d^rr, is 
an injective immersion of GjGm into M. Hence dirnG/Gm < dimM. Since '1'^, is 
onto M. 11.2.81 implies that the dimensions in fact are equal. As : GjGm M is 
an immersion, it follows that its tangent map is bijective at any point. Thus by the 
inverse function theorem it is a local diffeomorphism, hence a global diffeomorphism 
since it is bijective. □ 


1.2.10 Corollary. Let G be a second countable Lie group that acts transitively and 
freely as a Lie transformation group on the T 2 -manifold M. Then M is diffeomor¬ 
phic to G. 

Proof. If G acts freely then Gm = {e} for every m G M. Hence GjGm = G and 
the result follows from 11 .2^ □ 


1.2.11 Corollary. Let G be a compact Lie group that acts transitively as a Lie 
transformation group on the T 2 -manifold M. Then M is compact. 










Proof. Since G is compact, it is second countable. By 11.2.91 M is diffeomorphic 
to GjGm, for any m G M. Let tt : G —?> GfGm be the quotient map. Then 
7 r(G) = GfGm is compact, hence so is M. □ 


1.2.12 Example. We continue our study of the action of 0(n,]R) on 5" ^ as a 
Lie transformation group from 11.1.61 This action is given by 


$: 0(n,R) X ^ 5'”"^ 

{A, x) !->■ Ax. 


For any vector v G 5""“^ we can find an orthogonal matrix T with v as its first 
column, so Tei = v. Hence $ has only a single orbit, S^~^, i.e., it is transitive. 
The isotropy group Gg^ of ei, i.e., those elements of 0{n, R) that leave ei unchanged, 
are the matrices of the form 


A = 



with D G 0(n — 1,R). By 11.2.9] we conclude that S'” ^ is diffeomorphic to the 
quotient manifold 0(n,R)/Gei, i.e., S”“^ = 0(n,R)/0(n — 1,R). 


1.3 Groups of transformations on a T 2 -manifold 

1.3.1 Definition. A setG of transformations on aT 2 -manifold M which is a group 
under composition (gi, 32 ) '—>■ 01 ° 92 (ffi, 92 G G) is called a group of transformations 
ofM. 

According to ll. 1. 11 the map <f>: {g,rn) 1 — g{m) = gm then defines a transformation 
group on M. Moreover, this action obviously is effective. In this section we want 
to analyze whether G can be endowed with a Lie group structure such that $ is 
smooth. The following example shows that such a structure may not be unique in 
general: 

1.3.2 Example. Let G be the group of translations on and let (pa '■= z ^ 
z + a. Then G is bijectively mapped onto by pa «, and this defines a C°°- 
structure on G, for any given C°°-structure on K^. But on R.^ there are different 
such structures, e.g., the standard one and the one from m Ex. 14.3]. Both of 
these induce corresponding structures on G such that $ is smooth. 

Recall from II.li^ that a complete vector field X on M induces an action of (R, +) 
on M as a Lie transformation group via the flow of X, := m 1 —>■ Fl^(m). We 
call the vector field X tangent to G if all the resulting transformations belong 
to G. 

1.3.3 Definition. A group of transformations on a T 2 -manifold M is called a Lie 
group of transformations if it admits a Lie group structure such that 

(i) The map $: {g,m) 1 —>■ gm is smooth. 

(ii) If X € X{M) is complete and tangent to G then the group homomorphism 
e I—^ is a one-parameter subgroup of G. 

The above is the desired property entailing uniqueness: 
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1.3.4 Proposition. A group of transformations of a T 2 -manifold M admits at 
most one structure as a Lie group of transformations of M. 

Proof. Let G and G" be two Lie group structures on a group of transformations 
that both satisfy 11.3.31 We have to show that id : G —^ G' is a diffeomorphism. 
We first note that bv 11.3.31 fib G is a Lie transformation group on M, denoted by 
G X M ^ M. Hence by 11.1.131 any v G TgG defines a complete vector field 
$(^) € Xj M) with Flf( to) = exp(en)TO. It follows that $(?;) is tangent to G. 
Now ll.3.^ fiil implies that e i—?> idoexp(£n) : R —?> G' is a one-parameter subgroup of 
G'. By [m 8.3] there is a unique v' G TgG' such that idoexp(e'(;) = exp'(en') (with 
exp' the exponential map of G'). Thereby we obtain a well-defined map v ^ v' 
from TeG to TgG'. 

Fixing a basis (ui,... , n„) of TeG, let (/? = {x^,..., x") be the corresponding canon¬ 
ical chart of the second kind, cf. m Rem. 8.5]. Then since id is a group homomor¬ 
phism, 

id o (exp(x^'(;i)... exp(x"nn)) = exp'(x^x()... exp(x”u(j), 

implying that id is smooth on the domain of ip. Smoothness at an arbitrary point 
of G then follows by writing id = o id o Lg-i. Interchanging G and G' in the 
above argument shows that id in fact is a diffeomorphism. □ 


1.3.5 Remark. Bv 11.3.31 (i), any Lie group G of transformations is also a Lie 
transformation group that acts effectively on M. Therefore, 11.1.171 implies that 
1R(G, M) is Lie algebra-isomorphic to qr. 

1.3.6 Proposition. If a group G of transformations of a T 2 -manifold M admits 
the structure of a Lie group of transformations then the setXt{M) of complete vector 
fields on M tangent to G is a finite-dimensional Lie algebra (namely 'R{G,M)). 

Proof. Denote the action of G on M by $: G x M —>■ M. Bv ll.l.llll every <I>(n) G 
iR{G,M) is complete with corresponding transformations : to i—>■ Flf^’'^(TO) = 
exp(en)TO, hence belonging to G. Thus 4)(n) is a complete vector field on M that 
is tangent to G, i.e., 51{G,M) C Xt{M). 

Conversely, let X G Xt{M) and let tfe := to i—>■ FI^(to) be the corresponding 
transformation of M. Bv 11.3.31 (ii), e >->• is a one-parameter subgroup of G, 
so by [m 8.3] there is a unique v G TeG with ^'^(to) = exp(eu)TO for all to G M. 
Thus the vector fields X and 4)(n) have the same maximal integral curves and hence 
coincide, so X G lk(G, M). □ 


1.3.7 Remark. There is in fact also a converse to 11.3.61 see [TSl p. 103]: Let G 
be a group of transformations of a T 2 -manifold M. If Xt{M) generates a finite¬ 
dimensional Lie algebra A of vector fields on M then G admits the structure of a 
Lie group of transformations of M and A = fk(G, M). 
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Chapter 2 


Integrability of distributions 
of non-constant rank 

2.1 Distributions of non-constant rank 

In m 17.32] we proved the classical Frobenius theorem on the integrability of 
distributions of constant rank: 

2.1.1 Theorem. Let M be an n-dimensional T 2 -manifold and let A be a k- 
dimensional distribution on M. Then the following are equivalent: 

(i) A is involutive. 

(a) Every point in M lies in the domain of a flat chart ip = (a;^,..., x”) for 
A, i.e., such that ■ ■ ■ ,dxk\m forms a basis of A(m) for each m in the 

domain of ip. 

(Hi) Every point of M is contained in an integral manifold of A. 

(iv) Every point m of M lies in a cubic chart {ip = (x^,... (p(!7) = 

[—c, cj" centered around m such that the slices Ua = x {®}) integral 

manifolds of A. If M' is a connected integral-manifold of A with M' C U 
then M' is contained in one such slice. 

Note that in m we called a distribution integrable if it satisfies (ii). However, in 
this chapter we will use (iii) instead, cf. 12.1.171 below. 

For the application to orbits of (local) transformation groups we have in mind, 
I2.1.1l is too restrictive in that it requires A to have constant rank (i.e., dimension) k 
everywhere. In this chapter we closely follow H. Sussmann’s article [H], as well as 
P. Michor’s exposition in [T3] to develop a theory of integrability for distributions 
of non-constant rank. 

Throughout this chapter, all manifolds are supposed to be T 2 and paracompact. 

2.1.2 Definition. We call 

Xioc(M) := |J{X(t7) \ UCM open} 
the space of local vector fields on M. 
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If X, V are local vector fields on M, then so is [X, Y], defined on the intersection of 
the domains of X and Y. We agree to consider the ‘empty vector field’ an element 
of Xioc(fW), to avoid having to add formulations like ‘if the domains of X and Y 
intersect’, and similar for further local notions to be introduced below. 

For any X G X\oc{M) we denote the maximal domain of Fl^ in R x M by Ux- Ux 
is an open subset of K x M (see O 2.5.17]). For each t G R, Fl^ is a diffeomorphism 
of some maximal open set Ut{X) (which may be empty) onto some open set U[{X). 
Note that U[{X) = U-t{X). 

For any n > 1 in N, any ^ = (Xi,..., X„) G Xioc(M)"-, any T = (<i,..., f„) G R", 
and any m G M we set 

^Tim) := (Fl^=(... Flf„'‘(m)...)). (2.1.1) 

The maximal domain of (T, m) i—>■ {m) then is an open subset of R" x M that 
will be denoted by and we let Ut{^) be the set of all m G M such that ^t(w) 
is defined, i.e., Ut{^) = {m G M | {T,m) G U^}. 

2.1.3 Definition. A diffeomorphism f'U U' between open subsets of M is 
ealled a loeal diffeomorphism. 

If fi '■ Ui ^ U' {i = 1 , 2 ) are local diffeomorphisms, then so is /i o / 2 , with domain 
ff^{Ui) and range /i( 172 nl 7 i). Moreover, ff^ : —)> Ui is a local diffeomorphism 
as well. A group of local diffeomorphisms is a set G of local diffeomorphisms that 
is closed under composition and inverses. 

Our main examples of local diffeomorphisms are flows of local vector fields. For any 
X G Xioc(Af) the set of all Flf' (t G R) is called the group of local diffeomorphisms 
generated by X, and is denoted by Gx- More generally, if D is any subset of 
Xioc(Af) then there exists a smallest group of local diffeomorphisms containing 
lJ{Gx I X G D}. This group (more precisely, pseudogroup) will be denoted by 
Gd- It is called the group of local diffeomorphisms generated by D. By definition, 
we have 

= {^T I 3n : e e T G R”}. (2.1.2) 

Given finite tuples A = (Ai,..., Am), /r = {pi ,..., Pk), by Xp we denote their con¬ 
catenation (Ai,..., Xm, Pi, ■ ■ ■ ,Pk), and by A we denote the reverse tuple (Am, • ■ •, Ai) 
With these notations we have 

fTPT' '■= f,T o Pt' = iiv)TT' and (Ct)~^ = C_f ■ 

A subset D of Xioc(Af) is called everywhere defined if the union of all domains 
of elements of D covers M. An analogous definition applies to groups of local 
diffeomorphisms. 

2.1.4 Definition. Let G be an everywhere defined group of loeal diffeomorphisms. 
Two elements mi, m 2 of M are ealled G-equivalent if there exists some g G G sueh 
that g{mi) = m 2 . The equivalenee classes of the resulting equivalenee relation on M 
are ealled the orbits of G. If D Q Xioc(Af) is everywhere defined then the Go-orbits 
are also called D-orbits. The orbit of m G M is called trivial if it equals {m}. 

Thus mi, m 2 belong to the same D-orbit if and only if there exists some n > 1, 
some ^ G D" and some T G R" such that ^t(wi) = m 2 . From this we immediately 
conclude: 

2.1.5 Lemma. Two points mi, m 2 belong to the same D-orbit if and only if there 

exists a piecewise smooth curve 7 : [a, 6 ] —>■ M such that 7 (a) = mi, 7 ( 6 ) = m 2 , 
with the following property: There exist a = to < ti <■■■< tr = b and Xi G D 
(i = 1,... ,r) sueh that, for each i, is an integral curve of Xi or of —Xi. 
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Such a curve 7 will be called a piecewise integral curve of D. 

2.1.6 Remark. We now want to endow the orbits of Zl with a natural topology. To 

this end, for any m G M and ^ G ZZ", let '.= T ^ and let C K" 

be its domain. Then the Zl-orbit Sm of m is the union of all the images of the 
mappings We equip S := Sm with the finest topology such that each p^^m 

(n > 1, ^ G ZZ") is continuous. Since each p^^m is continuous for the trace topology 
of M on S it follows that the topology of S is finer than the trace topology, i.e., 
S' ^ M is continuous. In particular, S is T 2 . In general, the topology on S will be 
strictly finer than the trace topology. Since all are connected and their images 
all contain m, S is connected. 

2.1.7 Remark. The topology on S does not depend on the choice of m G S. To 
see this, denote by Sm the set S equipped with the topology induced by the maps 
p^^m- By symmetry, it will suffice to show that id : Sm Sm' is continuous for all 
m, m' G S. Pick rj, Tq such that ryTb(m') = m. Then p^^m = T ^ ^TPToijn'), which 
is the composition of T —>■ TTg and p^n,m'■ Now p^n,m' is continuous into Sm', so 
p^ m '■ U^^m —>■ Sm' is continuous. By the universal property of the finest topology, 
the claim follows. 

We next generalize the definition (ffH Def. 17.1]) of distribution to the variable 
rank setting: 

2.1.8 Definition. A distribution on a manifold M is a mapping A that assigns to 
every m G M a linear subspaee A(m) ofTmM. A set D of local vector fields is said 
to span A if, for every m G M, A(m) = span{X{m) \ X G ZZ}. 

If ZZ C Xioc(M) is everywhere defined then it spans a unique distribution, which will 
be denoted by A^. Any distribution which is of the form A £> for some everywhere 
defined family ZZ C Xioc(ZVZ) is called smooth. 

We say that a local vector field A on M belongs to a distribution A if A(m) G A(m) 
for every m in the domain of X. Let 

Da '■= {X G Xioc(ZVZ) I X belongs to A}. 

Then A is if and only if A is spanned by Da- 

A distribution A is called invariant under a group of local diffeomorphisms G if 

Vm G M : Tmg{X{m)) C A(g(m)) (2.1.3) 

for all 5 G G such that m is in the domain of g. In this case also Tgmg~^ maps 
A{g{m)) into A(m), i.e., Tmg{A{m)) = A{g{m)). It follows that the dimension of 
A (to) is the same for all points to in the same G-orbit. 

A distribution Ai is said to be contained in a distribution A 2 if Ai(to) C A 2 (to) 
for all TO G M. If A is a distribution and G is a group of local diffeomorphisms 
on M then there is a smallest distribution A® which contains A and is G-invariant 
(namely the intersection of all such distributions). More precisely, A^(to) is the 
linear span of all vectors v G TmM such that v G Aim) or v = Tm>g(w) for some 
g G G and m' G M with w G A(to') and to = gm'. 

2.1.9 Remark. Let A be spanned by ZZ C Xioc(ZIZ). Then A® is spanned by the 
family 

ZZ U {g*A I A G ZZ, g G G s.t. 5 *A is defined }. 

It follows that if A is C°° then so is A*^. 
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Note that li G = Go then in fact ISp’^ is spanned by 

V ■= {g<,X \ X £ D, g £ G s.t. g^X is dehned } (2-1.4) 

(i.e., the union with D is not required). To see this, note that any X G D with 
domain, say, U can be written as X = for g = Fljf = idj/ G Go- If H C 
XiociM) then a Gu-invariant distribution is called G-invariant. The smallest D- 
invariant distribution which contains A is denoted by A^, i.e., 

A^ := A*^®. 

Next, let D be an everywhere defined subset of X\qc{M). Then we set 

Pd := Aj’:= (Ad)^ . (2.1.5) 

Thus Pd is the smallest distribution that is G-invariant and contains Ad- Bv l2.1.9l 
Pd is smooth and the dimension of PDirn) depends only on the H-orbit of m. 

2.1.10 Remark. It is important to note that the H-orbits are precisely the Pd- 
orbits: to see this, by (12.1.211 and the remark following 12.1.41 it suffices to note that 
if 5 = G Gd and X G D, then X g*X, and therefore (by [TTl 17.8]), 

Fir^=eToFlf oCt. 

Hence the flows of the do not alter the H-orbits. 

Thus the following definition makes sense: 

2.1.11 Definition. Let S be an orbit of an everywhere defined subset D ofXiodM). 
For any mG S, the dimension of Pd (m) is called the rank of S. 

Trivial orbits are characterized by the following result: 

2.1.12 Lemma. Let D C X\oc{M) he everywhere defined and let S be the orbit of 
m G M. The following are equivalent: 

(i) PD{rn) = {0}. 

(ii) The orbit of m is trivial, i.e., S = {m}. 

Proof. (i)=>(ii): By (12.1.41) . for any X G D and g G Gd such that g^:X is defined, 
g^:X{m) = 0. Therefore, the flow of any element of Pd leaves m unchanged. 
(ii)=>(i): Suppose that PD{m) d {0}) then by (12.1.41) there would exist some g G Gd 
and some X G D with g^,X{m) d 0- But then the flow of g„X would leave {m}, 
contradicting (ii). □ 


2.1.13 Example. As in 11.2.71 let M = and X := —ydx + xdy. Now set 
D := {X}. Then the D-orbit of any (xo,yo) is a circle through (xo,?/o) with center 
(0, 0), while the orbit at (0, 0) is trivial. In this example, Pd = Ad'- In fact, this is 
always true when D consists of only one vector field X because in that case 

Fi(Fif o Flf o Fldt = Flf, 

so X and (F1)^)*X have the same flow and therefore coincide. 
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2.1.14 Definition. A set D C X\oc{M) is called involutive if for any X,Y G 
XiociM) that belong to D also [^,1^] belongs to D. If D is any subset of Xioc{M) 
then the smallest involutive subset of XiodM) that contains D will be denoted by 
D*. A smooth distribution A is called involutive if the corresponding set is 
involutive. 

2.1.15 Lemma. Let D be an everywhere defined subset of XiodM). Then 

Ad Q Ad* Q Pd- 

Proof. The first inclusion is immediate since D C D*. The second inclusion follows 
since Pd is involutive by 12.2.41 (iv) below. □ 


2.1.16 Example. Clearly the first inclusion in l2.1.15] can be proper. This example 
shows that the same may happen for the second inclusion. Let M = Xi := 

X 2 = where 4>{x, y) = dd) s-nd d is b, smooth function with tpd) = 0 for a: < 0 
and ifd) > 0 for a: > 0. Let D {Xi, A 2 }. Then since D{x,y) has dimension 2 
for a; > 0, the same is true for Pd Ad- Moreover, any point in can be joined 
to a point {x, y) with a; > 0 by a piecewise integral curve of D, so in fact Pd has 
dimension 2 everywhere. However, for a; < 0 the distribution Ad* has dimension 1. 

2.1.17 Definition. An immersive submanifold S of M is called an integral mani¬ 
fold of a distribution A on M if, for all s G S, we have TgS = A(s). A C°°- 
distribution A is said to be integrable, or to have the integral manifold property if, 
for every m G M, either the orbit of m is trivial or there exists an integral manifold 
S of A such that m G S. 

Note that we notationally suppress the inclusion map j : S ^ M here. If a C°°- 
distribution A is integrable then a smooth vector field X belongs to A if and only if 
X is tangent to every integral manifold of A. By El 17.16], any such X can locally 
be viewed as a smooth vector field on any given integral manifold. Moreover, by 
[TTl 17.22] and l 2 . 1 .I^ we have: 

2.1.18 Lemma. Any integrable -distribution is involutive. 

Note, however, that the converse is not true in the present situation (contrary to 
constant rank distributions!). In fact. 12.1.161 provides an example of a distribution 
Ad that is involutive: 


[Xi,X2]{x,y) = d'{x)dy = 

but cannot have the integral manifold property (the dimension of the integral man¬ 
ifolds at a: = 0 would have to be 1 and 2). 

2.1.19 Definition. Let A be a smooth distribution on M. A maximal integral 
manifold of A is a connected immersive submanifold S of M such that 

(i) S is an integral manifold of A, and 

(a) every connected integral submanifold of A that intersects S is an open sub¬ 
manifold of A. 


ip{x) 


X 2 X > 0 

0 X <0 
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A is said to have the maximal integral manifold property if through each point of 
M with Pn^{m) ^0 (cf. \ 2.1.1 2\) there passes a maximal integral manifold of A. 

In particular, any two maximal integral submanifolds through the same point m 
must coincide. 

For the discussion below we will need the following auxiliary result: 


2.1.20 Lemma. Let X, Y be smooth local vector fields on M. Then for any m in 
the intersection of the domains of X,Y we have: 


[X,Y]im) 


fL 

dt 


Firv*(Fl^v7(Fl^(Fl^(r^)))) 


Proof. For any local smooth function / we have 

^(Flf) V = X(/) o Flf = {FlfriLxf) = (Flf )*(T/(A)). (2.1.6) 

Since 


dt 


/(Fl!:v*(Fl^y*(Fl^(Fl^(m))))) = Tf 





and Tf{[X,Y]){m) = {[X,Y]{f)){m), the claim will follow if we can show that for 
any such / we have 


d 

dt 


((Fl^)*(Fl^)*(FF^)*(Fir^)*/) (m) = ([A,r](/))(m). 


Using (I2.1.6|) we obtain 




= ((Fl^)*Lx((Fl^)*(Fl5^)*(Fir^)7) 

+ (Fl^)*(FQ*Fv((FF^)*(Fir^)7) 
-(Fl^)*(Fl^)*(Fl5v*)*Fx((Firv*)V) 


-(Fl^)*(Fl^)*(Fl5vf)*(Fl!:vf)*(^v/)) 


1 gjVi) 

2-\/i 2\/t 


We need to calculate the limit as t \ 0 of this expression. Now since ( 7 ( 0 ) = 0 it 
follows that 


limA^ 

t\0 2y/t 



Again using (12.1.61) we calculate: 


g'{0) =[LxLx + Lx{Ly — Lx — Ly) + {Lx + Ly)Ly + Ly{—Lx — Ly) 

— {Lx + Ly — Lx)Lx + LxLy — LxLy — LyLy + LxLy + LyLy^f 
= ‘^{LxLy - LYLx)f = 2[X,Y]f, 


giving the claim. 


□ 
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2.1.21 Remark. To clarify the geometric meaning of A^. and Pd, and also to 
motivate the structure of the main results in the following section, suppose that a 
subset D of Xioc{M) is given and that we want to find a distribution A with the 
property that the orbits of D are precisely the maximal integral manifolds of A. 
It is then geometrically natural to define A (to) as the set of all tangent vectors of 
smooth curves that pass through to and lie entirely in the U-orbit of to. Call this 
set of curves r^- 

For any X G D, the integral curve t !->■ Flf'( to) belongs to Fm- Consequently, A(to) 
must contain (m) = X{m). Moreover, for X,YgD the curve 

f^Fir^(FF^(Fl^(Fl^(TO)))) 

belongs to Tm- Bv l2.1.201 the derivative of this curve at t = 0 is [X,Y]{m), which 
therefore must also he in A(to). Iterating this procedure it follows that Ad* must 
be contained in A. 

There may, however, be further vectors beside those in AD*(jn) that have to be 
contained in A(to): Let X G gM. and set to' := F1^(/(to). If 7 is a smooth 

curve with 7 ( 0 ) = to' and 7 G F^' then the curve a := t 1 -^ Fl^( 7 (t)) belongs to 
Tm- Setting v := 7 '( 0 ) it follows that cr'(O) = Tm'Fl^(u). We conclude that for 
any v G A(to') we must have T^'FI^T (v) G A(to), i.e., A must be T)-invariant (i.e., 
Gu-invariant). 

These considerations suggest to define A as the smallest Z?-invariant distribution 
that contains Ad*- We shall see below that this distribution coincides with the 
smallest I?-invariant distribution that contains Ad, i.e., with Pd- 

The above also explains why A^* by itself may be too small to serve our purpose: 
it may not contain sufficiently many directions: one may move within the orbit of 
TO along an integral curve of some X G D, catch up a new direction there and come 
back. Only Pd is large enough to also contain these directions. 


2.2 Orbit structure and integrability 

Throughout this section, let D be an everywhere defined subset of XiodM), and let 
S be an orbit of D (cf. 12.1.121) . We equip S with the natural topology introduced 
in 12.1.61 and we will use the notations introduced there. We set 

D°° ■= IJ D", 

nGN>o 

the set of all finite tuples of elements of D. If ^ G then is open in R" and 
P^,m '- U^^rn M, T I—>■ ^t(w) is C°°. Also, for any m G S, S is 

continuous. 

Given f G D°°, to G M, and T G U^^m, we set 

V{^,m,T) := TTP(_,miTTU^^rn), 

the image of the tangent space of at T under the tangent map of p^^m- Setting 
Wo := ^r(w), V(^,m,T) is a linear subspace of Tm^M- 

2.2.1 Lemma. Let ^ G D°°, m G S, T G U^^m, and toq := ^t(w). Then 


V{^,m,T) C Pd(too). 
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Proof. To begin with, let n = 1, ^ = X and T = to. Then = {t £ M | 
3Flj^(m)} and TtgU^^m = R- Also, p^,m(t) = Flf'(w), so setting m' := Fl^(m) we 
have TtgP^^m = X{m') and therefore 

V(X,m,to) =span(A(m')) C Aoim') C Pjo(m'). 

Suppose now that the claim is already true for n — 1. Let ^ and T £ U^^m- 

Then we may write ^ = Ary, T = toT' for suitable X G D and ry £ T' £ Urj^m 

and to £ K.. By definition, V{^,m,T) = im{TTp^.m)- Here, we have p^^rn{to,T') = 
F^fgiPv,m{T')), so that 

T{to,T')Pi,m = X (Fl^^ {prj^m{T ))) © Tp^^^(7'/)F1(^ (Tr'Prj.m) ^ ^ 

= Xi^Tim)) (BTp^^^iT')Flfg{TT>Pr,,m). 

Therefore, H(^,to,T) C span(A(^T(TO))) + ^(r/)Fl^(H(?y, m, T')). By our in¬ 

duction assumption, V{p,m,T') C Pjy(px'i'm)), and since Pd is P-invariant we 
obtain 

Tp,MT')Kinv,rn,r)) C Poi^Tim)). 

Moreover, A(^t(w)) £ Ad(Ct(w)) C PD(^T(m-)), so altogether V{^,m,T) C 
Poi^Ti'm)), concluding the proof for n. □ 


2.2.2 Lemma. Let mo £ S. Then there exist ^ £ D°°, m G S, and T G such 
that ^t(w) = Wo and m, T) = Pjy^mo). 

Proof. We will see that the claim follows once we establish the following two 
statements: 

(i) If 5, ry £ P°°, m, m' G S, T G U^^m, and T' G Urj.m' are such that Ct(w) = 
PT'{m') = mo then there exist a G D°°, m" G S, and T" G Ucr,m" such that 

V{^, TO, T) U H(ry, to', T') C V{a, to", T"). 

(ii) There exists a finite subset A of Pd(too) that spans Pd(too) and satisfies: for 
every v G A there exist ^ £ D°°, m G S, and T G such that ^t(w) = toq 
and V £ V{^,m,T). 

In fact, let us suppose that (i) and (ii) have already been established. By (ii), any 
element of Pd(too) is a linear combination of the elements of A =: {ui,..., Vk} and 
there exist ^i, mi, Ti (1 < i < k) such that Vi G V(^i,mi,Ti) and ^Ti(mi) = mg 
for all i. By (i), then, there exists one V(^,m,T) containing all V(^i,mi,Ti), so 
Pd(too) C V{^,m,T). Together with I2.2.'T] this finishes the proof. It therefore only 
remains to show (i) and (ii). 

To see (i), set to" := to', a := ^|ry, and T" := T{—T)T'. Then 

p<y,ra"{T") = cjT"{m") = (C_f (w'))) = r]T'{m') = mo- 

Analogously to (12. 2. Ill we may therefore split the tangent map of Pa,m" at T" into 
a direct sum of three maps, corresponding to differentiation with respect to T, —T, 
and T', respectively. Therefore V{a,m",T") = im{TT" p,T,m") contains the sum of 
the images of these partial maps. Since = to, the first of these maps 

is TTP^^m, so V{^,m,T) = im.{TTp^^m) C V{a,m",T"). Moreover, since ^t^-t is 
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the identity, the third of these maps has image miiTT'Pr^^m') = w',T'), so also 

V{r],m',T') C V{a,m",T”). This proves (i). 

To see (ii), let A be the set of all vectors Y{mQ), where Y is of the form g^,X for 
some X G D and some g G Go- By (12.1.4L A spans Pd (mo). Since dim Pd (mo) < 
dim(M) we may extract a finite subset A from A that still spans Pd (mo). Given 
any vGAwe have v = Tmg{w), where g G Gd, m G S, g(rn) = mo and w G TmM 
is of the form w = X{m) for some X G D. Also, since g G Gd we have g — for 
some ^ G D°° and some T G U^^m- 

Now set T] := ^X and T' := (T, 0). Then /^^'(m) = ^rim) = mo, so Pn^miT') = 
Pt' {m) = o Fljf (m), and splitting the tangent map as above we find 

TT'Pr],m = TTP^^m © Tm^T{X{m)). 

In particular, v = Tm^T{X{m)) G \m{TT'p-q,m) = F(?7,m,r'), which gives (ii). □ 


2.2.3 Lemma. If a connected integral manifold N of Pd intersects S then N is 
an open subset of S (in the topology introduced in \2.1.6\) . 

Proof. Note first that if N intersects S in, say, m then S must be non-trivial by 
12.1.121 because 1 < dimT^A^ and TmN = PD{m). Let T) as in (12.1.41) . then from 
12.1.101 we know that the P-orbits are precisely the P-orbits. 

By (|2.1.4|l . for any m G N there exist Xi,... ,Xp G V such that {Ai(m),..., Xp{m)} 
form a basis for PD{ra) = TmN. In particular, p = dim(A). Let 

^:W ^ N 

(P,...,tp)^Fl^^oFli^^o...oFi;^'’(m). 

Then $ is a diffeomorphism from an open neighborhood of 0 in onto a neighbor¬ 
hood of m in N: In fact, since Xi,. ■ ■, Xp are tangential to N it follows from m 
17.16] that their restrictions to N can be viewed as vector fields on N, and since 
9i$(0) = Xi(m), the inverse function theorem gives the claim. 

From what was said above it follows that every point in the image of $ lies in the 
same P-orbit as m. Thus any point in N has a neighborhood that is contained 
in one orbit of P. Since N is connected, it follows that N is contained in a single 
P-orbit S (given ni, n 2 G N we may connect them by a smooth curve 7 in N. Now 
covering 7 by neighborhoods as above it follows that the entire curve lies in the 
same orbit). Consequently, if D 5” ^ 0 for some P-orbit S, then N C S. 

It remains to show that N (as a set) is open in S. Bv l2.1.6l we need to see that for 
any m G S and f G P" (n G N), p'^m(^) open in K". Thus let T = (p,..., tn) G 
U^,m Q R" be such that p^^m{T) G N. Since f = (Ai,..., A„) G P", it follows that 

p^,„(r)=Fl^^oFli^^o...oFl^'‘(m). 

Note that G P C P for all i, so as in our considerations concerning $ above 
we may view the restrictions to N of the Xi as local vector fields on N. Thus for 
1 < i < n, Aijjv G Xioc(A). As in (12.1.11) it follows from this that the maximal 
domain of the map K." x A —>■ A, (T, m') 1—>■ (T) is open in R" x A, so in 

particular there exists an open neighborhood of T in R" which is mapped by /5{,m 
into A. We conclude that p^m(A) is open in R", as claimed. □ 

After these preparations we are now ready to prove the first main result of this 
section: 
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2.2.4 Theorem. Let D C X\ociM) be everywhere defined. Then 


(i) If S is a non-trivial orbit of D, then S, equipped with the tovoloav from A2.1~6[ 
admits a unique -structure such that S becomes an immersive submanifold 
of M. The dimension of S equals the rank of S (see \2.1.TT\) . 

(ii) With the topology and differentiable structure from (i), each non-trivial orbit 
of D is a maximal integral submanifold of Pjj. In fact, the non-trivial orbits 
of D are exactly the maximal integral submanifolds of Pd . 

(Hi) Pd has the maximal integral manifold property. 

(iv) Pd is involutive. 

Proof. We note first that (iii) is immediate from (ii), and that (iv) follows from 
(iii) via 12.1.1^ 

(i) Let mo G S. Then bv 12.2.21 there exist m G S, f G D°°, and T G such that 
^t(w) = mo and V{f,m,T) = PD(mo). It follows that 

k := rk(S') = dim(Pi:i(mo)) = dim(P(^, m, T)) = dim(im(TT/95,m)), 

so rkT(p{,m) = k. As we noted before 1^1.Ill k = dim(PD(p)) for any p G S. 
Therefore, if T' G then since \m{TT'p^^m) = V{(,,m,T') C PD{p^,m{T')) by 
12.2.11 it follows that rk.{TT'p^^m) < k, i.e., the rank of cannot exceed k anywhere 
in 

On the other hand, the rank of p^,m locally can only increase, so by the rank theorem 
i pm 1.1.3]) there exist open neighborhoods U of T in K" and V of mo in M and 
diffeomorphisms ip : U ^ t{U) Q R” (where n is such that ( G H"), <p(T) = 0 
Ip : V —>■ ip(V) C (with I = dim(M)), ip{mo) = 0, such that the following 
diagram commutes: 

U F 

p (2.2.2) 

(-1,1)" (-1,1)' 

Here, in,i,k is the map 

(Xi, . . . , Xn) H, (Xi, . . . , Xfe, 0,..., 0). 

l-k 

Denote by N the (regular) submanifold V’~‘'(*n.i,/c((—1,1)")) of M. Then, as a set, 
N = p^^ra{U). liT' gU and m' := p^^rn{T'), then since (12.2.21) commutes we obtain 

T^.N = TV>-i(vi,fe(Tr-(^(M”))) = Tt-p^,„(R") = V{f, m, T'). 

Bv l2.2.11 V{f,m,T') C PDpm') and since both spaces have dimension k they must 
be equal. Thus N is an integral manifold of Pd- By definition of N C S, so 
12.2.31 shows that, as a set, N is open in S. In the same way, 12.2.31 applies to any 
open connected (in the natural manifold topology of N) subset of N. As these sets 
form a basis of the manifold topology of N, the natural inclusion map I : N ^ S 
is an open map. Moreover, due to (12.2.21) I can be decomposed as 


U p^^^(u)=N F 



(- 1 , 1 )” ^- (- 1 , 1 )' 
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Here, I' is the inclusion map and (reversing the roles of I and n above) 


k,n,k = (a^l, •.. ,Xi) (xi, . . . ,Xfc,0, 


, 0 ) 


n—k 


(Note that k < min(?,n) by the above). Hence 

I = Pi,m O O ii^n,k O-lpO r 

By definition of the topology on S', is continuous as a map into S, so also I 
is. We conclude that / is a homeomorphism onto its image and that this image is 
open in S. 

Denote by Af the family of all manifolds N constructed as above. Also, let set(A^) 
denote the underlying set of the manifold N. By the above, {set(A^) | N G J\f} is 
an open cover of S and each I : N ^ S is a, homeomorphism onto its image. This 
provides a family of differentiable structures on the elements of an open cover of S. 
Our aim is to define a differentiable structure on S such that each N G Af becomes 
an open submanifold of S. To this end it suffices to show that for any Ni,N2 G Af 
the differentiable structures of set(A^i)nset(A^ 2 ) as an open submanifold of Ni resp. 
N 2 coincide. Call these manifolds Wi, W 2 and let j : Wi —>■ W 2 be the identity. 
By symmetry we only have to show that j is C°°. Since both ii : Wi ^ M and 
*2 : H 2 ^ M are immersions, by [M 1.1.8] to see this it suffices to show that j is 
continuous. This, however, is immediate since both ii and 12 are homeomorphisms 
onto the same open subset of S and j = o ii. 

It follows that S possesses a structure of a smooth manifold whose natural manifold 
topology is precisely the topology from l2.1.^ Also, since each N as above is an open 
submanifold of M it follows that S itself is an immersive submanifold of M which 
is an integral manifold of Pd- Suppose that S' is another such smooth structure 
on S. Then i : S ^ S' is a homeomorphism and both S ^ M and S' ^ M 
are immersions, so again by [101 l-l-S] i is a diffeomorphism. Hence the smooth 
structure on S is unique. 

(ii) That S is an integral manifold of Pd was already shown in (i). In particular, 
dim(S') = k = dim(PD(TOo)). Also, S is connected bv 12 . 1.61 Now let R be any 
connected integral manifold of Pd with Rfi S ^ By 12 . 2.31 set(i?) is an open 
subset of S. As we did in the proof of (i) for S, we may apply [1231 to each open 
connected subset of R to see that the inclusion map R ^ S is an open map. Denote 
by R' the open submanifold of S with underlying set set(i?) and let / : i?' —^ i? be 
the identity map. Then by what we have just shown, I is continuous. Note that 
both iji ■ R ^ M and j/j' : R' ^ M are immersions, and iR/ = iro I. Therefore, 
again by [TUI l-l-S] it follows that I is smooth, and is in fact an immersion. Since 
both R and i?', being integral manifolds of Pd, have the same dimension k, I is even 
a local diffeomorphism and, due to its injectivity, a diffeomorphism. We conclude 
that R = R' as a manifold, i.e., R is an open submanifold of S. Thus S is indeed 
a maximal integral manifold of Pd, as claimed. Finally, let m G M and let P be a 
maximal integral manifold of Pd containing m. Then by what we have just shown, 
R is contained in the P-orbit S of m. Since, conversely, S' is a connected integral 
manifold of Pd and R is maximal, we in fact have R = S. Thus the maximal 
integral manifolds of Pd are precisely the orbits of P. □ 

The second main result is as follows: 

2.2.5 Theorem. Let A be a smooth distribution on M and let D C X\oc{M) 
span A (in particular, D is everywhere defined). Then the following statements are 
equivalent: 
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(i) A is integrable. 

(ii) A has the maximal integral manifold property. 

(Hi) A is D-invariant. 

(iv) For every A G I?, i S R, and m € M such that Flj^(m) is defined, 

T^Flf (A(m)) C A(Flf (m)). 

(v) For every m G M there exists k > 1 and elements X^,... ,X^ of D such that 

(a) A(m) = span(A^(m),..., A''(to)), and 

(b) for every X G D there exists some e > 0 and smooth functions f) : 
(—e,e) —7> K (1 <i,j < k) such that 

k 

[A,A*](Flf (m)) = ^/;(i)X^(Flf (m)) 
i=i 

for i = 1,... ,k and t G (—e, s). 

(vi) A = Pd. 

Proof. Clearly, (iv)=>(iii)=>(vi), and (ii)=4»(i). Also, (vi)=>(ii) follows from 12.2!^ 

(i)=>(v): Let m G M. If Poim) = 0 then we may pick any X^ G D. Then 
X^ljn) = 0, and (v) (a) holds trivially. Next, lei X G D and set f) = 0. By (i) also 
[A, A®] G D, so [A, A®](m) G A(m) = {0}. Also, X{m) = 0, so Fl^{m) = m for all 
t and also (v) (b) is satisfied trivially. 

So let PD(m) ^ 0 and let S be an integral manifold of A through m. Also, let 
A^,..., A^ be elements of D such that {A^(to), ..., X^{m)} is a basis of A(m) = 
TmS. Then k = dim(S') and the restriction of the A® to S can be viewed as 
elements of Xioc(*S'). By continuity of the determinant it follows that there exists 
a neighborhood U of to in S' such that {A^(to'), ..., A^(to')} is a basis of Tm'S = 
A(to') for all m' G U. li X G D then A is tangent to S, hence also restricts to a 
local vector field on S. It follows that also all [A, A®] are tangent to S, and so they 
must be smooth linear combinations of the A® on U. This implies (v) once we pick 
e > 0 so small that the curve 1 1 —>■ FI^(to) remains in U. 

(v)=>(iv): By the flow property, it suffices to show (iv) for t G (—e, e). For any such 
t, let W^{t) := (F1^)*(A®)(to). Then by [TTJ 17.7] we get 

(7IT® _ ^ 

= (Flf)*(L;,A®)(TO) = (Flf)*([A,A®])(TO) =^/®(t)(Flf)*(A^KTO). 

Hence ..., satisfy the system of linear ODE 

on (—e, e). Since {IF^(O),..., W^{Q)} forms a basis of A(to), the same is therefore 
true for {W^{t ),..., W^{t)} for any t G (—e, e). Now since A® G D, 

T^Flf (IT®(t)) = A®(Flf (to)) G A(Flf (to)), 

so indeed T^Flf {A{m)) C A(F1^(to)). □ 

Next, we want to connect the above results to the approach taken in [T3]. To do 
this, we need to introduce some notions first. 
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2.2.6 Definition. D C X\oc{M) is called stable if for all X,Y G D we have 

(Fi^)*y g ^ such that this expression is defined. A local vector field X on 

M is called an infinitesimal automorphism of a distribution A if TmFl^(A{m)) C 
A(Flf‘(TO)) whenever defined. The set of infinitesimal automorphisms of A is de¬ 
noted by aut(A). 

For any D C XiodM), let V C XiodM) be as in (12.1.41) (for G = Go)- 

2.2.7 Lemma. Let D C Xioc(iW)- Then 

(i) T) is the smallest stable subset of XiodM) containing D. 

(ii) D is stable if and only if'D = D. 

(in) If D is stable then Pd = Ad- 

Proof, (i) By (12.1.41) . D C V. Moreover, V is stable: if g^X, hPT S T> (for 
9 = £,T,h = Vt' S Gd and X,Y € D) then bv l2.1.10l we obtain 

(Flf = (Ct o Flf o ^_f)dh.Y) = {g_f, o o Flf o ^_^)*Y S V. 

On the other hand, ii D' D is stable then given X G D and g = for 

T = {ti,...,tn) then {F\dd)*X G D) so (Fid^fd)*^ ^ ^ leading 

to g^X = (FidD* ... (FidddX G D', i.e., V C D'. 

(ii) is immediate from (i), and (hi) follows since A-p = Pd by (12.1.41) . □ 

After these preparations, we have (see [131 3.24]): 

2.2.8 Theorem. Let M be a smooth manifold and let A be a smooth distribution 
on M. Then the following statements are equivalent: 

(i) A is integrable. 

(ii) Da is stable. 

(Hi) There exists a subset D C XiodM) such that T) spans A (i.e., s.t. Pd = A), 
(iv) aut(A) (~l Da spans A. 

Proof, (i)o(iii): this is 12.2.51 (i)o(vi), applied to D := Da. 

(i)^(ii): LetA, y G Da and m G M. Then y(Fl(^(m)) G A(Fl^(m)), and so[2?23] 

(i) =>(iv), applied to D = Da gives 

(Flf )*y(m) = TFlffy(Flf (m))) C A(m). 

Thus (Flf )*y G Da. 

(ii) =>(iii): Set D := Da, then the claim follows from l2.2?7l (ii). 

(i)=>(iv): By 12.2.51 (i)=>(iv), applied to Da, it follows that Da C aut(A). Thus 
aut(A) n Da = Da, which spans A by definition. 

(iv)=>(iii): Set D := aut(A) (~l Da. Then D spans A by assumption, and for 
X G D and Y G Da we have TFlfj(y(Flf (m))) G A(m) since X G ant (A), 
so (Flf)*y G Da. Iterating this argument it follows that for n G N, A^, F G D 
(1 < z < n), ^ = (Ai,..., A„) andT G K”, (^T)*b" G Da. Therefore, D C D C Da, 
so T> spans A and the claim follows. □ 

Next, following [131 3.25], we want to analyze the local structure of the integral 
manifolds of an integrable distribution. For this we first introduce an important 
class of immersive submanifolds: 
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2.2.9 Definition. Let A be any subset of , and for m G A denote by Cm.{A) the 

set of all points in A that can be joined to m by a smooth cwrvfli] in M lying in A. 
A subset N of M is called an initial submanifold of M of dimension n, if for each 
m G N there exists a chart of M centered at m such that 

^m(Cra{U^ H N)) = <p^{U^) (R" X {0}) C X (2.2.3) 

In order to see that calling such subsets submanifolds is justified, we need an aux¬ 
iliary result: 

2.2.10 Lemma. Any piecewise smooth curve c : [a, &] —>■ M admits a reparametriza- 
tion as a C°°-curve. 

Proof. Let a = t^ < ti < ■ ■ ■ < tk = h he such that each curve is smooth. 

For each i pick a smooth map (fi : [a, 6] —R such that 4>i(t) = 0 for t < ti-i, 
4>i(t) = 1 for t > ti, and (j) is strictly increasing on [ti-i,ti]- Then the map 

k 

(/) := <0 + ~ 

i=l 

is smooth, strictly increasing on [a, &], (j){[a,b]) = [a, &], and — 0 

for all i and all m > 1. Thus co cf is the desired reparametrization. □ 


2.2.11 Lemma. Under the assumptions of \2.2.!A let mi,m 2 G N and Umi, Um 2 
be as in (12.2.31) . Then H N) D Cm 2 {Um 2 ^ ^)) is open in R” x {0}. 

Proof. We may suppose that the intersection is nonempty, so let p G Cmi {Umi H 
N)nCm 2 iU^ 2 ^N). Then 

w := Cp(u^, n c/„, n A^) c (u^^ nN)n n n) 

In fact, there is a smooth curve ci from mi to p in Umi bl N and if q G IF then 
there is also a smooth curve C 2 from p to g in Umi H Um 2 ^ ^■ The concatenation 
of Cl and C 2 can be reparametrized smoothly bv l2.2.10l so q G Cmi{Umi H N), and 
analogously for m 2 ■ We claim that 

Pmi(lT) = nc/^jn (R" X {0})). (2.2.4) 

C: ifmiiw) c ifmi{Cmi{Umi H N)) = ipmiiUmi) H (R” X {0}) and Pmi(IF) c 
TmiiUmi n C/ma)- Moreover, any q G IF is connected to p within IF by a smooth 
curve c, and so ipmi o c connects the corresponding images. 

A: Let c be a smooth curve from ipmi(p) to pmi {q) in 'Pmi{UmiUUm 2 )Ui^^ x {0}). 
Then c := o c is a smooth curve in Umi Um 2 (d N from p to q. 

The right hand side of (12.2.41) is precisely the connected component of (pmi (p) in 
TmiiUmi n C/ma) d (I^" X {0}) in R" X {0}, hence is open. Since p was arbitrary, 
this shows that pmi {Cmi {Umi d IV) n Cm 2 {Umi d IV)) is open, as claimed. □ 


2.2.12 Theorem. Let N be an initial submanifold of dimension n of MK Then 
there is a unique C°^ -structure on N such that the inclusion map i : N M becomes 
an injective immersion and such that the following universal property holds: For any 
manifold and any map f:R^ N, f is smooth if and only if io f: R ^ M is 
smooth. Moreover, N is paracompact. 

^By a smooth curve here we simply mean a C°°-map from some interval into M. 
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Proof. For any m G N, bv l2.2.9l there exists a chart {Um, <Pm) of M around m such 
that ipm{Cm{Um H N)) = ifmiUm) H (R" X {0}). We define a chart for iV at m by 
{CmiUmr\N),')prn ■= ^m\c^(u,nnN))■ Then the chart transition functions V’mi o 
are the restrictions of the smooth transition functions (pmi o ^ to subsets 

of R" X {0} that are open bv l2.2.1fl hence we obtain a differentiable structure on 
N. From this choice of charts it immediately follows that i is an immersion. Thus 
N is an immersive submanifold of M. In particular, the natural manifold topology 
of N is finer than the trace topology of M on TV. 

To show (the non-trivial part of) the universal property, let /: N he such that 

j o / : i? —>• M is smooth, let r G i? and choose a chart {U, tf) at fir) in M such 
that (p{Cf(r)iU n N)) = (p{U) n (R” X {0}). Then since f~\U) is open in R we 
may pick a chart {V^ip) in i? at r with V C f~^{U) such that 'f{V) is a ball in 
Mf. Then since V is C°“-contractible and i o f is smooth it follows that any point 
in f{V) can be connected by a smooth curve in M that lies entirely in 17 fl fV with 
/(r). Therefore, f{V) C Cf(^r){U fl iV), and we can write 

{^\Cf(^){unN)) ofoip-^ = ^ofo ■0-1. 

Here, the right hand side is smooth by assumption and the left hand side is the 
chart representation of / around r, as a map from R to N, so indeed /: R ^ N is 
C°°. 

To see uniqueness, denote by N' another differentiable structure on N with the 
universal property. Then id : fV —>■ M and id : —>■ M are smooth, hence so are 

id : fV —^ and id : A^' —>■ N, i.e., the smooth structures in fact coincide. 

Concerning the paracompactness of N, note that any connected component of N 
is contained in a connected component of M, hence is second countable by m 
14.9]. Alternatively, M can be equipped with a Riemannian metric g (simply by 
gluing local Riemannian metrics on charts via a partition of unity) and this induces 
a Riemannian metric i*g on N. The claim then follows from [71 Satz 1.1.2]. □ 

Next we show that also a converse of the previous theorem holds. 

2.2.13 Theorem. Let /: —>■ be an injective immersion between manifolds 

which has the universal property from \2.2.12l i.e.: if h: P ^ N is smooth and 
h{P) C /(M), then the induced map h: P ^ M with f o h = h is smooth. Then 
f{M) is an initial submanifold of N. 

Proof. By JlOl 1.1.3], given any m G M, there exist charts (</?, IT) centered at m 
in M and (0, V) centered at f{m) in N such that 

(■0 0/0 (/ 3 -I) = i := ,..., x^) !->■ (a;!,..., a;^, 0,..., 0) G 

We may assume that W = f~^(V), so i : (/?(1F) —t f’iV). Pick r > 0 so small that 
{a; G R* 1 Ja:] < 2r} C (/?(1F) and {?/ G R* ] \y\ < 2r} C ■0(17). Define B’jiO) resp. 
H(,(0) to be the open ball of radius r in R^ resp. R*, and set 

U := 0-\h'(O)) C N, Wi := ip-^(B^(0)) C M 

We show that (0 := ■01c/, t/) satisfies (12.2.31) . In fact, since 

■•■,0) 1 \y\ < r})) = {(?/0 ...,/, 0,..., 0) ] \y\ < r}ni{ip{W)) 

= {( 2 /\---,/, 0 , ...,0) 1 \y\ < r}, 

we have 

0-1(0(17) n (R'= X {0})) = 0-i({(2/i,..., 0,..., 0) 1 1 j/ 1 < r}) 

= / o(^-i((0o/o(^-i)-i({(j/i,...,y'=,O, ...,0) 1 \y\ < r})) 

= /o(^-i(i3(:(0)) = /(lTi). 
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Now ^ o f{Wi) = i(B^(0)) C (0), so f{Wi) cun f{M). Also, f{Wi) is C°°- 
contractible, so altogether we get 

n X {0})) = f{Wi) c C/(™)(C7 n /(m)). 

Conversely, let n G Cf(^m)iU n/(M)). This means that there exists a smooth curve 
c : [0,1] —>■ with c(0) = f{m), c(l) = n, and c([0,l]) C [/ fl f{M). By the 
universal property of / it follows that the unique curve c : [0,1] —>■ M with c = foe 
is smooth. 

We show that c([0, 1]) C Wi. Since c([0, 1]) C f~^{U) C f~^(y) = W and c(0) = 
m (since / is injective), it follows that (/9(c(0)) = 0. Thus if c([0,1]) % W\, by 
continuity of c there must exist some t G (0,1] where tp o c intersects dB^{0), i.e., 
with c(t) G But then 

V'O ficit)) G V'o/O ip-\dB^,{0)) = lidB^m c dBliO), 

and so if o c{t) = if o f o c{t) ^ S),(0), i.e., c{t) ^U, a contradiction. We conclude 
that c([0, 1]) C Wi, and therefore n = f(c(l)) G f(Wi). Altogether, 

C/m(c/ n f(M)) = f(Wi) = if-\if{U) n (K" X {0})), 

which shows that f{M) is an initial submanifold of N. □ 

The previous result in particular applies to the situation where f = j ■ M ^ N, 
i.e., where M is an immersive submanifold with the universal property. In fact, 
the setup of I2.2.1d1 is only seemingly more general: if /: M —is an injective 
immersion, then by transporting the manifold structure of M to f{M) via /, i.e., 
by declaring f: M ^ fi^) to be a diffeomorphism it follows that j : f{M) ^ N 
becomes an immersion, hence f{M) turns into an immersive submanifold of N, and 
the universal property from 12.2.131 translates into the one from 12.2.121 

We now return to the study of integrable distributions. 

2.2.14 Theorem. Let A be an integrable distribution on MK Let S be a non-trivial 
orbit of Ha and let m € S. Then there exists a cubic chart (U, ip = (a:i,..., xf)) 
centered at m, (p{U) = (—e, ef for some e > 0, some k > 1 and a countable set 
A C such that 

ip{U n 5) = {a: G p>iU) \ ,..., a:*) G A}. 

If the distribution is of constant rank k then the above holds for every non-trivial 
orbit intersecting U, with the same k. Moreover, each non-trivial orbit is an initial 
submanifold of M. 


Proof. Let k := dim(5') and pick Xi,... ,Xk G Ha such that {Xi{m), ..., Xk{m)} 
is a basis of A(m). Next, choose a chart (x = (j/^, ■ • ■, y’’), W) around m in M such 


that Ai(m),.. .,Xk{m), 



is a basis of TmM. Let 


f{t^ 


t^) := (Fl^i o ... o Fl^'=)(x-'(0,..., 0, t'^+\ ..., t')). 


Then / is a diffeomorphism from some neighborhood of 0 G onto a neighborhood 
of m in M, and we take ip := f~^ on a suitable neighborhood U of m, for which 
we may suppose that p{U) is a cube (—e,e)* with center 0 = (p{m). Since S is an 
orbit of Ha, 


m' o • • • o Fl^‘’(m') G S 
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for all m' and where the right hand side is defined. Therefore, for any 

... ,t'-) € U we have 

m' = f{t\ /(O,..., 0, t’^+\ (2.2.5) 

This means that C/ fl S' is the disjoint union of connected sets of the form 

Uc ■■= {to' e U I a;'‘+^(TO') = Cfc+i,.. .,x\m') = c;} 

where c = (cfc+i, ■. ■ ,ci) is constant. By assumption, A is integrable, so 12.2.51 and 
12.2.41 show that any orbit is a maximal integral manifold of A. Therefore, since 

;7, = {(F1^^ o...oFl^'')(x-i(0,c)) I G (-£,£)"}, 

and S is an integral manifold of A, the proof of 12.2.51 demonstrates that Uc is 
an open (and connected) submanifold of S. Now S, being a connected immersive 
submanifold, is contained in a connected component C of M, and C is second 
countable since M is paracompact. Thus by El 14.9], S is itself second countable. 
This shows that there can at most be countably many Uc as above. If A is of 
constant rank k then clearly the above construction works for this same k for any 
orbit that intersects U. 

Finally, from (12.2.511 it follows that to' G Cm{U n S) if and only if m' € Uq, i.e., 

ifiCmiu n s)) = <f{u) n (r'' x {o}), 

so S is indeed an initial submanifold of M. □ 


2.2.15 Remark. The previous result provides an alternative proof of 12.1.11 (iii) 
^ (iv). 

2.2.16 Definition. A chart as in^KKJ^is called a distinguished chart for A. The 
connected components of U D S are called plaques (or slices). 

2.2.17 Definition. Let D C Xioc{M) be everywhere defined. D is said to satisfy 
the reachability condition if the D-orbits are exactly the connected components of 
M. 

A necessary and sufficient condition for reachability is given in the following result: 


2.2.18 Theorem. Let D C Xioc(Af) be everywhere defined. Then the following 
are equivalent: 

(i) D satisfies the reachability condition. 

(ii) For every m £ M we have dunP£)(m) = dim(M). 

Proof. Let n := dim(M). 

(i) =>(ii) Suppose that for some to G M we have k := dimPD{m) < n. Then 
('by 12.2.41 ('i'l'l the orbit S oi D through to is a fc-dimensional connected immersive 
submanifold of M. It follows that the interior of S in the topology of M is empty. 
But any connected component of M is open, so S cannot be such a connected 
component, contradicting our assumption (i). 

(ii) ^(i) By m 14.1], every maximal integral manifold of Pd is open (being an 
immersive submanifold of the same dimension as M), and connected. Also, M is 
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the disjoint union of the orbits of D which, bv 12.2.41 (ii), are exactly the maximal 
integral manifolds of Pd- Thus these orbits are the connected components of M. □ 

Recall from 12.1.141 that if D is any subset of Xioc(-M) then the smallest involutive 
subset of Xioc(M) that contains D is denoted by D*. 

2.2.19 Corollary. With M, n, D as in \2.2.1^ if Ad* has dimension n for every 
m € M then D satisfies the reachability condition. 

Proof. Bv 12.1.151 Ad* C Pq, so dim(PD(m)) = n for all m £ M, and the claim 
follows from l2.2.181 □ 

To conclude this chapter we show how the classical results on the integrability of 
constant rank distributions can be derived from the results established above. 

2.2.20 Definition. Let D C Xioc{M) he everywhere defined. D is called locally of 

finite type if for every m G M there exist X ^,..., £ D such that 

(i) A£)(to) = span(X^(m),..., X^(m)), and 

(ii) for every X G D there exists a neighborhood U of m in M and smooth func¬ 
tions f) £ C°°{U) (\ <i^j < k) such that 

k 

[X,X^]{m')=Y,f]{m')X\m') 

f=i 


for j = 1,..., fc and all m' G U. 

2.2.21 Theorem. If D is locally of finite type then Ad has the maximal integral 
manifold property. 

Proof. By assumption, A = Ad satisfies (v), and thereby also (ii) of l2.2.5l □ 


2.2.22 Corollary. If D* is locally of finite type then Ad* = Pd o-nd the non-trivial 
D-orbits are precisely the maximal integral manifolds of Ad* ■ 

Proof. We know from 12.1.151 that Ad* C Pd. Bv I2.2.2T1 Ad* has the maximal 
integral manifold property, so 12.2.51 shows that Ad* is {D*-, hence) D-invariant. 
Therefore by definition of Pd we get Pd C Ad*, i.e., Ad* = Pd- The claim then 
follows from 12.2.41 (ii). □ 

We can now derive the following version of the classical Frobenius theorem. 

2.2.23 Theorem. (Frobenius) Let A be a smooth distribution on M of constant 
rank k. Then the following are equivalent: 

(i) A has the maximal integral manifold property. 

(ii) A is involutive. 

Proof. (i)=>(ii) This is 12.1.181 

(ii)=>(i) Let m G M and pick X^,... ,X^ £ A such that {X^{m),... ,X^{m)} is 
a basis of A{m). Then also {X^{m'),... ,X^{m')} is linearly independent for all 
m' in some neighborhood U of m in M. Since dim(A(?7i')) = k for all m', any 
local vector field on U that belongs to A is a linear combination of A^,..., with 
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smooth coefficients. If X belongs to A then since A is involutive, [X, A*] G A, and 
so Ha is locally of finite type. Since = A by definition, (i) follows from l2.2.21l 

□ 


2.2.24 Corollary. Let D C X\oc{M) be everywhere defined, involutive and of 
constant rank. Then A]j has the maximal integral property. 

Proof. By 12.2.251 it suffices to show that A^ is involutive. Let m G M and pick 
X ^,..., G A such that {A^(to'), ..., A^(m')} is linearly independent for all m' 
in some neighborhood {7 of m in M. If A, A' G Ad on U then both A and A' are 
linear combinations of A^,..., A^ with smooth coefficients. But then also [A, A'] 
is such a linear combination of the X^ and of brackets of A^ and X\ which also 
belong to Ad due to the involutivity of D. Thus [A, A'] G Ad- □ 


2.2.25 Remark. Collecting some of the results proved above we obtain an inde¬ 
pendent proof of the classical Frobenius theorem 12.1.11 In fact, (iii)<J4>(iv) follows 
from l2.2.15l 12.2.231 shows that (i)=^>(iii), and (iii)=^(i) is clear. (ii)=^(i) is the easy 
part of [TTJ 17.13]. 

Finally, to see that (iv)=J>(ii), let {ip, U) be a chart as in (iv). Then for any point m 
in U, dj;i \m,. ■., d,^k\m span the tangent space at m of the slice Ua containing m. 
But Ua is an integral manifold of A, so this tangent space equals A(m). 
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Chapter 3 


Symmetry groups of 
differential equations 

3.1 Local transformation groups 

In Chapter [T] we studied Lie transformation groups on differentiable manifolds. Such 
group actions are always defined globally ($: G x M —>• M). For the applications 
to symmetry groups of differential equations we have in mind, the natural actions 
will typically not be defined globally, however. In this section, following [niiii], 
we therefore study local transformation groups by means of the tools developed in 
Chapter [21 Throughout, we will assume M to be a connected paracompact (hence 
Hausdorff and second countable) C°“-manifold. 

3.1.1 Definition. A local transformation group (or local Lie transformation group) 
on M consists of a Lie group G, an open subset U of G x M with {e} x M Q 14, 
and a smooth map 14 ^ M such that 

(i) If [h, m) € U, {g, d)(/i, m)) £ lA and {gh, m) £ U then 

$( 5 ,$(/i,m)) = <^{gh,m). 

(ii) For all m £ M, $(e,m) = m. 

(Hi) If {g,m) €U then {g~^,^{g,m)) €14 and (by (i), (ii)) 

= m. 

We will often abbreviate $(g,m) by g ■ m. We set 

lAg ■.= {m € M \ {g, m) €14} {g € G) 

I4m-.= {g €G \ {g, m) €14} {m € M) 


and 


^g-. I4g ^ M, m !->■ $(g, to) 

^rn-l4m^M, g 1-^ $(g, to) 

For 14 = G X M we obtain a global transformation group as in 11.1.21 
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3.1.2 Definition. A subset % ^ S Q M is called an orbit of the local transformation 
group $: W C G x M —>■ M, or a ^-orbit, if it is a minimal G-invariant subset of 
M, i.e., 

(i) Ifm€S,g€G and {g,m) G U then ^{g,m) G S. 

(a) If S' C S is another subset of M satisfying (i) then either S" = 0 or S' = S. 
For any m G M we denote by S,f,^rn the orbit of m under 

3.1.3 Remark. If $ is a global transformation group then 

S^,,m = $m(G) = G • m = {$(g, m) \ g G G}. 

For a local transformation group U ^ M we obtain 

= {p G M \ 3k G N, 3g^ G G {I < i < k) : gk G Km, ^ 

gt G . g^.m (1 < * < fc - 1) and gi . gk - m=p} 

In what follows, we will illustrate many of the concepts we consider in the following 
example: 

3.1.4 Example. Let M = G = (M, +), and 

^e,ix,y)) := 

\ 1 — ea; 1 — ea; / 

The natural domain of $ is 

U = {(e, (a;, y)) I £ < i for X > 0, £ > — for x < 0, £ G R for x = 0} C R x R^, 

X X 

which is an open subset of G x M, and $ is C°° on U. One easily checks that 
$(£l, $(£ 2 , (x, y))) = $(£i + £ 2 , (x, y)), 

whenever both sides are defined. Thus we obtain a local transformation group. Its 
orbits are the points on the y-axis and the straight half-rays emanating from the 
origin (except for the positive and negative y-axis). Thus they are either single 
points or (regular) submanifolds of M. 

Note that $ cannot be realized as the restriction to U of some global Lie transfor¬ 
mation group on R^: in fact, for any x ^ 0 we have |$(£, (x,i/))| —>■ oo as £ —>■ i. 

3.1.5 Definition. A local transformation group GxMAU^M is called 
(i) connected, if 

(a) M and G are connected. 

(b) lA is connected. 

(c) Urn is connected for each m G M. 

(a) semi-regular, if all orbits can be endowed with a smooth structure as immersive 
submanifolds of M of the same dimension. 

(Hi) regular, if it is semi-regular and every m G M possesses a neighborhood basis 
of open sets U such that for every orbit S of G the set U D S is connected in 
S. 
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Blanket assumption: From now on we will assume all local transformation groups 
to be connected in the above sense. 

3.1.6 Remark. 

(i) Since S (being a manifold) is locally pathwise connected, a semi-regular trans¬ 
formation group is regular if and only if every m € M possesses a neighbor¬ 
hood basis of open sets U such that for every orbit S' of G the set [/ fl S is 
pathwise connected in S. 

(ii) By |1()1 1.1.12], every orbit of a regular transformation group is a regular 
submanifold of M. 

3.1.7 Example, (i) The map from 13. 1 !4l defines a regular transformation group 

on \ {(0,?/) I 2 / G R}. 

(ii) (Cf. [TTl Ex. 18.7]) Let M = x he the two-dimensional torus and 

G = (K, -f). Fixing w G K and using angular coordinates {9, p) on M we set 

^{e,{9, p)) := {9 + e, p + cue) mod 27r. 

Then the orbits of $ are immersive submanifolds of dimension 1, so G acts semi¬ 
regular ly on M. If ct) G Q then the orbits are closed curves and $ acts regularly. 
However, if cu is irrational then the orbits are dense in M and therefore cannot be 
regular submanifolds by m 14.1]. So in this case $ does not act regularly on M. 

Our next aim is to show that the (non-discrete) orbits of any local transformation 
group can naturally be endowed with the structure of an immersive (indeed even 
initial) submanifold of M. 

3.1.8 Remark. If<I>:GxM —^ Misa global (connected) Lie transformation 
group on M then we have already shown this in 11.2.41 and 11.2.51 Denoting by Gm 
the isotropy group of m in G we have bv 11.2.21 that the map 4'^ : GjGm G ■ m, 
gGm '—>> gm is a bijection. If the isotropy group Gm of m is open then bv ll.2.6l the 
orbit of m is the singleton {m}. Otherwise, declaring 'hm to be a diffeomorphism 
we may endow G • m with a smooth structure as an immersive submanifold of M. 
The fact that G • m is in fact an initial submanifold of M will follow from 13.1.141 
and 13.1.151 below. 

Turning now to the case of local transformation groups, we need to come up with a 
different construction since in general Gm will no longer be a subgroup of G in this 
case: in fact, if 51 , 32 G Gm then in general we will not have that (g]"^ ■ g 2 ,m) G G. 
But this fact was used in ll.2.2l to obtain injectivity of 4'^. 

The route we will take to finding a smooth structure on the orbits of G goes via 
the infinitesimal generators of the action of 4>. We first note that also for a local 
transformation group we may introduce the definitions of $(u) and fk(G, M) exactly 
as in (ll.l.2p . However, ‘R{G,M) C Xioc(TL) now. Also, as in 11.1.12] it follows that 
the map 4>: u 1 —>■ 4>(u) is a Lie algebra homomorphism from onto 3?(G,M), the 
Killing algebra of 4>. Concerning [TTTT2| we have to be careful about the domain Um 
and note that for a local transformation group the vector field 4>(u) need no longer 
be complete: 

3.1.9 Proposition. Let U ^ M be a local transformation group, and let m G 
M and u G g. Then for all e such that exp(ew) G Um we have 

Fl*^’'^(m) = $(exp(eu), to) = exp(eu) • to. 
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In particular, 


<i>(exp(£z;), m). 


$(w)(to) 


d 


de 0 


Proof. For e in a sufficiently small interval around 0, exp(£u) G Um- On any such 
interval (hence on the maximal such interval), we can argue as in the proof of ll.1.131 

□ 


For any to G M we set 


:= {$(i;)(to) | u G g}. 

Any HjniGjM) is a linear subspace of T^M. 

3.1.10 Example. For the example from 13.1^ we have 

®(4|o)(a;,y) = To<P(,,^y){de\o) = ^elo + 4|o dy 

= x^dx + xydy =: X{x,y). 

Therefore, 'Ji(^x,y){G, M) is one-dimensional if a; ^ 0 and equals {0} for a: = 0. 

To verify [T71.13I in this example (cf. 11.1.1411 we first calculate the flow of X. The 
integral curve c(e) = {x{e),y{e)) of X through (a;o,?/o) has to satisfy the following 
initial value problem: 

dex{s) = a;(£)^ 
d^y{e) = x{e)y{e) 

(a;(0),y(0)) = (xo^yo)- 

Indeed we obtain 

(a:(e),y(e)) = {-r ^—) = $(e, (a:o, 2/o)) 

\ 1 — ea;o 1 — £a:o / 

with maximal domain (—oo, X) jf > q, (^,oo) if xg < 0, and K, respectively, 
if xq — 0, i.e., the maximal domain of c is precisely lI{xo,yQ)- la general, one can 
only expect that U{xQ,ya) is contained in the domain of the corresponding maximal 
integral curve. Also, the image of c is the orbit of (a;o,yo)- 

3.1.11 Proposition. Let GxM^U^M be a local transformation group. 
Then the Killing algebra 1R(G, M) of $ spans an integrable distribution 

A$ := 

on M. 

Proof. By definition (see ()1.1.3p l. tR.(G,M) consists of the smooth local vector 
fields $(n) for n G g, hence A$ is a smooth distribution on M. Pick any basis 
{ui,...,r!fc} of g and set Xi := $(ni) G tR{G,M) for 1 < i < fc. Then bv 11.1.121 
{Xi,... ,Xk} is a Lie algebra of local vector fields spanning A$. Consequently, A$ 
satisfies condition (v) of 12. 2. 51 and thereby is integrable. □ 

Note that the dimension of A$ may vary from point to point, so we really need the 
theory of distributions of non-constant rank from Chapter [2] From [ITl. Ill it follows 
bv 12. 2. 41 and 12. 2. that the non-trivial orbits SAq,,m of A$ are initial submanifolds 
of M. It therefore remains to show that these orbits in fact coincide with those 
introduced in 13.1.21 i.e., that Sg,^m = For this we need some auxiliary 

results. 
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3.1.12 Lemma. Let U ^ M be a local transformation group and as in \S.l.l\ 
for m G M let Um = {5 G G | (g, m) GU} {m G M). Then for any g G Um there 
exist n G N and vi,... ,Vn G Q, ti,... ,tn G K>o sueh that 

(i) g = exp(ti'(;i).exp(t„t)„) 

(a) Vs G [0,t„] : exp(sz)„) G Um 

(Hi) Vi = 1,..., n - 1 Vs G [0, U] : exp(s'(;i) exp(tj+i?;i+i)... exp(t„z;„) G Um- 

Proof. Denote by >V the set of all g G Um that satisfy (i)-(iii). Then W ^ 0 
since e G W. To show that W is open, let g G W and pick an absolutely convex 
neighborhood 1/ of 0 in g with exp{V)g C Um- Then exp(y)(ji C W: given h G 
exp{V)g we can write 


h = exp(u) exp(tiui).exp(t„u„), 

and since V is absolutely convex it follows that also exp(to) exp(tir;i).exp(t„ti„) G 

Um for all t G [0,1]. But also Um \ W is open: suppose that g G Um \ W and again 
choose an absolutely convex neighborhood F of 0 in g with exp(y )5 C Um- Suppose 
that (exp(t/)g) nW 7 ^ 0, then there exists some v gV such that h := exp(?;)g G >V. 
Since exp(P)g C Um we also have exp{tv)g G Um for all |t| < 1. It follows that 

gt ■-= exp{t{-v))h = exp((l - t)v)g GUm {t G [0,1]). 

This, however, shows that g = gi is an element of W, contradicting our assumption. 
Therefore, exp{V)g C Um\yV, implying that Um\yV is open. Since Um is connected 
by our blanket assumption following 13.1.51 W = Um - ^ 


3.1.13 Lemma. Let U M he a local transformation group, let m G M, 
g G Um and pick n G N, wi,... ,u„ G g, < 1 ,... G M>o such that (i)-(iii) of \3.1.12\ 
are satisfied. Define curves 


{1 = 1 , 2 ) 


for s G [0, tn] by 

7i(s) := exp(su„)m and 72(5) := Flf G") 
and for s G EJ=i+i tj > E"=i tj] {i = 1, - - - ,n - 1) by 


n 

7 i(s) := ^exp (^(^s - ^ exp(ti+iVi+i)... exp(t„v„)'j 

j=i+l 

72(.) := Flf«. , (Fl*<7‘l, F,«-)(„),,,,), 

Then 71(5) = 72(3) for all s G [ 0 ,X]r=i 


m 


Proof. For s G [0,t„] this is immediate from 13.1.121 and 13.1.91 Proceeding by 
induction, suppose that we already know that 71 = 72 on [ 0 ,X]”=i+i tj]- Now set 

pi{s) := {exp{svi) exp{ti+iVi+i) .. .exp(t„u„)) • m 
P2(s) := FlfG.)(Fi^K+i)(,. .Flfj’'")(m))...). 
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Then it suffices to show that pi = p 2 on We do this by showing that pi satis¬ 

fies the same initial value problem as p 2 on [0, ti]. Set exp(ti_|_iUi+i)... exp(t„nn) =: 
h. By [ni 8.2], Rh{eyip{svi)) = Flf ' {h), so 

^{Rh{eyip{svi))) = i?"“(i??i(exp(swi))). 
as 

Using this, together with (jl.l.4p . we calculate 

= ^$m(-R/i(exp(sWi))) = T^rniRT (.Rh{exp{sVi)))) 
as as 

= ^{vi){^rn{Rh{exp{sVt)))) = ^{Vt){pi{s)), 

which is precisely the defining ODE for p 2 . Also, the initial values coincide since 
pi(0) = P 2 ( 0 ) by our inductive assumption. □ 

After these preparations we may now show the equality of the two kinds of orbits 
we introduced above: 

3.1.14 Proposition. Let GxM^U^M be a loeal transformation group 
and let m G M. Then 

S^,m — 5^AcE.,m- 

In particular, each non-trivial orbit is an initial submanifold of M. 

Proof. Since G is connected, once we choose a basis {ni,..., Vk} of g, any element 
of G is a product of certain exp(eiVi). Thus bv l3.1.13l it follows that 5$,™ is trivial 
if and only if $(?;i)(m) = 0 for all i, i.e., if and only if A$(m) = 0. Since A$ is 
integrable bv l3. 1.11112. 2. Sl fvil gives that A$ = P%(g,m) (recall that A<i, = A^(^g,m) 
by definition). Consequently. E~1.12l shows that the trivial orbits of and S'A^.m 
coincide. 

Turning now to the case of non-trivial orbits, let m' G and let g G Um'- 

By 13.1.121 there exist n G N, vi,... ,Vn G g and ti,... ,tn G K>o such that g = 
exp{tiVi).. .expftnVn) and (i)-(iii) of 13.1.121 are satisfied (with m' instead of m). 
Let f := (<i)(ni),..., <l>(u„)) G Xioc(Af)", and T := (ti,...,<„). Then using the 
notation (12.1.11) . 13.1.131 shows that g ■ m' = frirn'), and by definition frim') stays 
in the orbit SAis,,m of to, hence g ■ m' € From this, starting with m' = m 

and then continuing inductively, it follows that any gi ... gi ■ m stays in SA^,m, so 
by (13.1.111 C SA^.m- Since in the above consideration we started out with any 

m' G SA^,m and since the $-orbits are disjoint by definition we have even shown 
that SA^,m is the disjoint union of certain $-orbits. 

To conclude the proof we show that any 4>-orbit that is contained in S'A^.m is in 
fact an open subset of S'a*,™- Indeed, once we know this then due to being 

a disjoint union of such sets it will follow that any $-orbit contained in S'A^.m is 
both open and closed in it, and since iSa#,™ is connected the orbits must in fact 
coincide. 

Thus let to' G S'$,m Q SAij/.m (since we may move points within orbits this is the 
only case we need to consider). Choose Xi = <i)(ni) G IR{G,M) {I < i < n, for a 
suitable n and suitable elements Vi of g) such that Xi,..., A„ is a basis for A$(to'). 
Set f := (Ai,..., Xn). Then by the proof of l2.2.4l (i) and 12.2.51 ('vi'l we have 

rk(roP?,m') = dim(A$(TO')) = n. 

As in (12.2.21) it follows that there exists an open neighborhood [/ of 0 in M" such 
that p{_m'(G) is the domain of a coordinate chart of S'A 4 ,,m- 
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There exists an absolutely convex open neighborhood 1/ C {/ of 0 in R" such that 
for any T = (ti ,... ,tn) & V we have 


exp(tiui).. .exp{tnVn) e Km'- 


In particular, (i)-(iii) of 13.1.121 are satisfied. Since is open in SA^,m it 

suffices to show that p^^m' {V) C Also, because V is absolutely convex we only 

need to show that p^^rn'iT) S for any T = (ti,..., S F fl ]R>q (otherwise 

replace Vi by —Vi). For such a T. 13.1.131 (together with (I2.1.ip ') gives 

(T) = Fl^^ (Fl^= (... Flf„" (m')...)) ^ 2 ) 

= exp(tivi) . exp(t„u„) • m' G S^^rn, 

as desired. The final claim was already shown in the remark following 13.1.111 □ 

If the transformation group G is in fact global then at the moment we have two 
ways of endowing the non-trivial orbits of $ with a differentiable structure: 
the one from 13.1.81 and the one from 13.1.111 The following result shows that these 
approaches in fact coincide. 

3.1.15 Proposition. Let G x M ^ M be a global Lie transformation group 
and let m G M. Then the differentiable structures on any non-trivial orbit 
introduced in \S.l.S\ and in \S.l.lI[ \3.1.14\ coincide. 

Proof. For brevity, we set S := S<i,^rn, with the smooth structure from l3.1.111 and 
we write S' for the smooth structure on defined in 13.1.81 We know that the 
inclusions i : S ^ M and i' : S' ^ M are injective immersions. Since S is an 
initial submanifold of M and i' = io id 5'_>5 is smooth it follows from the universal 
property in 12.2.121 that id : S" —>■ S' is smooth. 

Conversely, to see that id : S —>■ S' is smooth, let p^,m'{V) (with m' G S) be as in 
the proof of 13.1.141 Then p^^rn' '■ V —>■ p^,m'{V) is a diffeomorphism onto an open 
submanifold (containing m') of S, so we only need to show that ids^s' o p^^m' is 
smooth on V. Bv 13.1.81 this is the case if and only if o p^^m' is smooth on V. 
Now since $ is global, it follows from ll.1.131 that for any T G V we have 

= ^'“1(exp(ti'Ui).exp(t„'u„) • m') 

= exp(tiwi).exp(t„u„) • Gm' 

and this map is indeed smooth as the composition of the smooth maps T 1 —>■ 
exp(fiui) exp(t„u„), P —>■ G, and the quotient map tt : G —>■ GfGm'. FI 


3.1.16 Remark. Suppose that G x M if U — >■ Misa semi-regular local 
transformation group. Then since all orbits of $ are immersive submanifolds of the 
same dimension, say k, it follows that the integrable distribution A$ is of constant 
rank fc > 1. If S' is an orbit of $ (i.e., bv 13.1.141 an orbit of then bv 12.2.41 

(ii) S is a maximal integral manifold of A$ and thereby a leaf of A$ (cf. [TTJ Def. 
17.23]). 

Recall from m Def. 17.27] that a flat chart for a distribution A is called regular if 
every leaf of A that intersects it does so in precisely one slice. Moreover, A is called 
regular if every point of M lies in the domain of a regular chart. The following 
result shows that for a regular local transformation group $, A$ is regular in this 
sense. 
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3.1.17 Lemma. Let ^:GxMDU^M be a regular local transformation group. 
Then each point of M lies in the domain of a regular chart for i.e., A$ is a 
regular distribution. 

Proof. Let m € M and pick a cubical chart {U, (p) as in 12.1.11 centered at to, 
p{U) = [—c, c]”. Since G acts regularly on M, there exists some neighborhood 
V C [/ of TO such that the intersection of each orbit S' of G (i.e., each maximal 
integral manifold of A$) with V is connected in S. This intersection is therefore a 
connected integral manifold of A$, hence by 12.1.11 it is contained in a single slice 
Ua of ip. Now pick c' G (0,c) so that U' := CV. If S is any orbit 

of G that intersects U' then it also intersects V, hence S fl 1^ G Ua for some a. 
Therefore, S DU' C [/'. But since S is a maximal integral manifold we also have 
U'a Q S, so altogether we obtain S dU' = U^. □ 


3.1.18 Example. Returning once more to the example from lB.llTl let 

U+ := {(x, y) G I X > 0} 

U- := {(x, y) G I X < 0} 

and set 


: t/± —>■ R X R± 
ix,y) ^ (x, ^ 

We show that {U+,p+) is regular chart for the transformation group $ from [HU] 
(and analogously for {U-,p-)). Note that p~^{r, s) = (r, rs). From l3.1.10l we know 
that A$ is spanned by the vector field X = x'^dx + xydy. In the chart {U+,p+), 
with coordinates (r, s) := p+{x,y) we have 

3t (9 s s 3t (9 s 1 

= —dr + =dr- -9s, Mdy) = ^9, + —9, = -9,. 
dx dx r dy dy r 

Therefore, X has the representation 

^^X = (tp )iV^*(9x) T )i(9^ '}2^^{dy) 

= r'^p^{dx) + rsp.f{dy) = r'^dr. 

By 13.1.41 the non-trivial orbits, i.e., the maximal integral manifolds of $ are half¬ 
rays emanating from (0,0), except for those lying on the j/-axis. Hence each such 
orbit S is either contained in [/+ or in U-. If S' C U+ then 

S = {(x, y) G I s = — = c and x > 0}, 

X 

so S is a slice of p+. This shows that {U±^p±) are regular charts for A$. Never¬ 
theless, $ is not regular because the points on the y-axis are trivial orbits, i.e., not 
every orbit of $ has the same dimension. Removing the y-axis from M = we 
obtain a regular local transformation group. 


Finally, we have the following fundamental result on the space of orbits of a regular 
local transformation group: 


3.1.19 Theorem. Let GxMdU^M be a regular local transformation group 
on an n-dimensional manifold M with k-dimensional orbits. Then the set M/G of 
orbits o/$ can be endowed with the structure of an {n — k)-dimensional manifold 
with the following properties: 


38 














(i) The quotient map tt : M ^ M/G, m is a surjective submersion. 

(a) m and m' belong to the same orbit if and only if Tr{m) = 7r(m'). 

(Hi) For any m G M, A^{m) = kerT^Tr. 

Proof. Surjectivity of tt and (ii) are immediate from the definition. Since the 
orbits of $ are precisely the leaves of the foliation induced by A$, which is regular 
bv 13.1.171 (i) is [TTJ 17.29]. For (iii), note that since n is constant on any orbit S 
of must vanish on the tangent space of S' at m G S'. But S is an integral 

manifold of A$, so T^S = A$(to). Thus A$(to) C keT{Tm'!T). Finally, since tt is a 
submersion, dimker(rm7’‘) = k = dim(A$(TO)), so we have equality. □ 


3.1.20 Remark. For later use we recall the construction of CH 17.29]: Given a 
regular chart {U,ip = {x^,.. x")), let U' := 7r([/). Then the map tp' : U' ^ 
m' I—>■ pr 2 (:^(m)), where m is any element of 7r“^(m') n U (recall that fl 

U = Ua for some a and pr 2 o = a) is a typical chart for M/G. The local 
representation of tt with respect to the standard charts then is 

(p'oTT 0(^-1 =pr2 = (x\...,x") (x'=+\ ..., x") : 

MAC/ —^ U' C M/G 

F 

t{U) t'{U) 

3.2 Symmetries of algebraic equations 

From this point of the course onwards we will closely follow Giver’s work [14]. Con¬ 
cerning notations, we will henceforth typically denote points in a smooth manifold 
M by X, y, ■ ■ ■, since the manifolds we are interested in will mainly be subsets of 
spaces of independent and dependent variables of differential equations. As already 
announced before 11.1.131 we will from now on denote the group parameter with e 
instead of t since we will often need f as a variable in a differential equation. More¬ 
over, given a local Lie group action $ on M we will often notationally suppress the 
Lie algebra homomorphism v i—>■ $(u) from (ll.l.2|) . i.e., we will often simply write 
V instead of $(u) G Xioc(Af). Also, we will usually only write G instead of 4). 

By a system of algebraic equations we mean any system of equations 

F,(x)=0, u=l,...,l, (3.2.1) 

where are smooth real-valued functions on M. The term ‘algebraic’ 

is used to distinguish this situation from the case of differential equations to be 
considered later on. It does not, however, restrict the form of the Fi, (e.g., to 
polynomials). A solution of (13.2.11) is any point x G M such that F,j{x) = 0 for 
y = 1,... ,1. A symmetry group of p.2.ip is any local transformation group G on 
M that transforms any solution of (13.2.111 into another solution. 

More generally, we define: 

3.2.1 Definition. Let G be a local transformation group on a smooth manifold M. 
A subset S C M is called G-invariant, and G is called a symmetry group of S if, 
whenever x G S and g G G is such that g ■ x is defined, then also g ■ x G S. 
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3.2.2 Example, (i) Let M = and consider the one-parameter group of trans¬ 
lations 

Gc : (x, y) (x -I- cs, y + e) (c G M). 

Then any line {x = cy + d} C is an orbit of Gc, hence is invariant under Gc- 

Any Gc-invariant subset of M is a union of such orbits. 

(ii) Again let M = R^ and for a G R let 

G“ : (x, y) !-)> (Ax, A“y) (A > 0). 

Then {(0, 0)}, as well as the positive and negative x- and y-axes are G“-invariant. 
Hence also the entire axes are invariant, being unions of invariant sets. Moreover, 
{xy = 0 } and {y = fc|x|“} (for x < 0 or x > 0 ) are invariant. 

(iii) Most of the time we will be interested in invariant sets S that are subvarieties, 
given by the common zero set of smooth functions F = (iA,..., T]): 

S = Sf = {x I Fy{x) =0, V = 1,... ,V\. 

(iv) If Si, S 2 are invariant subsets, so are Si U S 2 and fl 52 . 

3.2.3 Definition. Let G be a local transformation group acting on M and let 
F : M ^ N be a map into a manifold N. F is called G-invariant if for all x G M 
and all g G G such that g ■ x is defined we have 

F{g ■ x) = F(x). 

If N = M. then F is simply called an invariant of G. 

Obviously, F = (Fi,..., Fj) : M —>■ RMs G-invariant if and only if each F^, is an 
invariant of G. 

3.2.4 Example, (i) For Gc the group of translations from 13.2!^ lil. the function 

/(x,y) :=x-cy 

is an invariant: f(x + C£,y + s) = f{x,y). Moreover, any invariant of Gc must be 
of the form y(x — cy) for some smooth g. 

(ii) As in 13.2.21 fiil. let G^ : (x,y) 1 —>■ (Ax, Ay) (A > 0). Then f{x,y) := x/y is an 
invariant for G^, defined on {y 7 ^ 0}. Another invariant is (x,y) 1 —>■ xyjix^ F y^), 
defined on R^ \ {(0, 0)}. There is no smooth nonconstant invariant defined on all of 
R 2 . 


3.2.5 Remark. If F : M —>■ RMs a G-invariant function then every level set 
of F is a G-invariant subset of M: if Fix) = c and y • x is defined then also 
F(y • x) = F(x) = c. 

However, if the zero-set {F(x) = 0} of a smooth map F is G-invariant then F itself 
need not be invariant. For example, {(x,y) | xy = 0} is invariant under G^ from 
(ii), but F(x,y) = xy is not G-invariant since F(Ax, Ay) = A^xy 7 ^ F(x,y). 
To obtain a true statement we need to take all level sets into consideration: 

3.2.6 Lemma. Let G be a local transformation group acting on M and let F G 
C°“(M,R^). The following are equivalent: 

(i) F is invariant under G. 
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(ii) Every level set {F{x) = c} (c £ of F is invariant under G. 


Proof. (i)=J>(ii): See 13.2.51 

(ii)=>(i): Let x, g be such that g ■ x is defined and set c := F{x). Then also 
gx G {y \ F{y) = c}, so F{gx) = F{x). □ 

The following result gives a simple, linear, criterion for a function to be an invariant 
of a group action. It is a first typical example of Lie theoretic methods in symmetry 
analysis. Recall from 13.1.51 that we always assume transformation groups to be 
connected. 


3.2.7 Theorem. Let G be a local transformation group acting on M and let f £ 
C°“(M, R). The following are equivalent: 

(i) f is an invariant ofG. 

(ii) For every infinitesimal generator v of G we have v{f) = 0. 


Proof. (i)^(ii): For clarity, we de-identify again and write ^{g,x) = gx for the 
action of G. Let u £ g. Then since / is an invariant, given any x £ M, bv 13.1.91 we 
obtain for e small: 

fix) = fiexp{ev) ■ x) = /($(exp(eu),a;)) = f{Fif^'’\x)) = /(Fl^(a;)). 
Differentiating this expression with respect to e at £ = 0 we get 


v{f) = $(u)(/) = 0. 


(ii)=>(i): Since n(/) vanishes identically on M we have 

^/(exp(£u) • x) = ^/(Fl^(x)) = ?;(/)(Fl"(x)) = 0, 

wherever defined, so f{exp(ev) ■ x) = f{x) wherever defined. Since G is connected, 
by 13.1.121 any g € Ux can be written as a product of certain exp(eiVi) (vi £ g) 
satisfying (i)-(iii) from that result. Then 13.1.131 shows that we can iterate the 
above argument to conclude that f{g ■ x) = fix) for a\\ g &Ux- □ 

It follows that, if {?;i,..., z;^} is a basis of the local Lie algebra of infinitesimal 
generators of G (i.e., if {$(wi),..., ^(ur)} is a local basis of A$) then / is an 
invariant of G (on the open set where {z;i,..., Vr} is a basis) if and only if Vkif) = 0 
for fc = 1,..., r. If G acts effectively, then by 11.1.171 we may take for {?;i,..., Vr} 
any basis of g. Writing Vk = X]r=i local coordinates, this means that / has 

to satisfy the homogeneous system of linear PDFs of first order 


Vkif)ix) ='^Ckix) 

i=l 


K 

dxi 


0 (fc = l,. 


r). 


(3.2.2) 


3.2.8 Example. We return to the translation erroup G^ from l3.2.2l ('iL Its infinites¬ 
imal generator is 


V = 


fL 

de 


[x F ce,y F e) 

0 


(c, 1) = cdx + dy. 


We already know that f{x, y) = x — cy is an invariant, and indeed n(/) = 0. 

In the case of the scale group G“ from 13.2.21 (ii) we have an action of the multi¬ 
plicative group (R'*',-) on R^ whose generator therefore is v = 9aU=i(Ax, A“j/) = 
xdx + aydy and it is easily checked that the infinitesimal criterion is satisfied for 
the invariants of G“ given in 13.2.41 (ii). 
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Turning now to symmetries of systems of algebraic equations, we begin by deriving 
a general criterion for the local invariance of submanifolds under local group actions. 


3.2.9 Definition. Let G be a local transformation group acting on M. A subset 
S Q M is called locally G-invariant if for every x € S there exists a neighborhood 
Ux L- lAx (cf. \S.1.1\) of the identity in G sueh that g ■ x € S for all g £ lAx■ A smooth 
map F : U ^ N (U open in M) is called locally G-invariant if for each x € U there 
exists some neighborhood Ux QUx of e in G such that F{g-x) = F{x) for all g €Ux 
with g ■ X G U. F is called globally G-invariant if F{g ■ x) = F{x) for all x G U and 
g G G such that g ■ x G U. 

3.2.10 Example. Let G be the group of translations {x, y) i—>■ (x+e, y) on M = 
Then the set S := {(x, 0) | —1 < x < 1} is locally G-invariant, but not G-invariant. 
Let 

, '!_/ 0 y<0orj/>0 and x > 0 

f[x,y) I y > 0 and X < 0. 

Then / is smooth and locally G-invariant on U := \ {(0,y) | y > 0}: in fact, 

/(x-|-£,y) = f{x,y) for jej < |x|. But / is not globally G-invariant. 

3.2.11 Theorem. Let N be an initial submanifold of M. The following are equiv¬ 
alent: 

(i) N is locally G-invariant. 

(ii) The infinitesimal generators of G are everywhere tangent to N, i.e.: 

yx G N yv G q: v\x G TxN 

Proof. If$: GxM^U^M is the action of G then the above condition, written 
out, means (cf. (I1.1.2|) b 


Vx G TV Vx G g: ‘I>(x)|a; G TxN 


(i)=>(ii): Let v G Q, x G N. Since N is locally G-invariant, for |e| small we have 
<i)(exp(eu),x) G N. As N is initial, e i—>■ $(exp(ex),x) is smooth as a map into N 
as well. The derivative of this curve at e = 0 therefore is an element of TxN. This 
implies (ii) since bv 13.1.91 we have 


$(u)(x) 


A. 

de 


<i)(exp(£u), x). 

0 


(ii)=>(i): By assumption, any $(r)) is tangent to N, hence by m 17.16] it can be 
viewed as a vector field on N, and so the restriction of its flow to TV is a local 
diffeomorphism on N (cf. [TTl 7.2]). Given x G TV, Flf^’'^(x) exists for |£| small. 
Using [TT^ we therefore have for |£| small 


exp(£x) • X = <i)(exp(£u), x) = Flf^^^(x) G TV. (3.2.3) 


Next, choose a basis vi,... ,Vk of g. Then as in the last part of the proof of 12. 2. 31 it 
follows that there exists some c > 0 such that o • • • o FlJj_^’'''^(x) exists for all 

|£i| < c. In addition, we may assume c so small that exp(£ixi).exp(£fcUfc) G Ux 

for \ei\ < c. Then (i)-(iii) of l3.1.12l hold and so, bv 13.1.131 iterating (13.2.31) gives 

exp(£ixi).exp(£feUfc) • x = Fl^^^”^^ o • • • o Fl^^^”'=^(x) G TV 
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for these e^. Therefore, defining the neighborhood lAx '■= {exp(eini).. .exp(£/cnfe) | 
\ei\ < c} of e G G (shrinking c if necessary, cf. [11] 8.5]) we obtain that g ■ x £ N 
for aW g GUx- n 


3.2.12 Remark. As the proof of 13.2.111 shows, the implication (ii)=^>(i) remains 
correct even for N an immersive submanifold of M. However, (i)=>(ii) is not true 
in this generality: To see this, equip M = with a new manifold structure N 
whose charts are (pa ■ (x,a} >->■ x (a £ R), see m Ex. 14.3]. Then is a one¬ 
dimensional immersive submanifold of M with underlying set K^, hence is (even 
globally) invariant under any local group action on M. But clearly not every group 
action has generators tangential to N (i.e., horizontal). 

As the most important special case of l3.2.111 we consider zero sets of smooth maps: 


3.2.13 Corollary. Let G be a local transformation group acting on M, dim(M) = 
TO. Let F ■. M ^R} (I < m) be smooth and of maximal rank (= 1) at every solution 
X of the system F^{x) = 0 (1 < v < 1). Then G is a symmetry group of this system 
if and only if 

Vn £ 0 Viz = 1,... 1 Va; G M with F{x) = 0 : v{Fv)\x = 0 (3.2.4) 

Proof. By [TUI 1.1.23], A^ := {a; G M j F{x) = 0} is a regular (hence in particular 
initial) submanifold of M. Thus bv 13.2.111 N is locally invariant under G if and 
only if v\x £ TxN for all a: G A^ and n G g. By (TUI 1.1.25], 

i 

TxN = keriTxF) = f| ker(r,E,) 

for all a: G N. This implies that N is locally G-invariant if and only if for each 
x £ N, i.e., for each x £ M with F{x) = 0, each a; G g, and each v = 1,... ,l we 
have that 

0 = TxF^{v\x) = v{F^)\x. 

It remains to show that local invariance implies invariance of N under G. Thus 

let X G A^ and g £ lAx- Bv 13.1.1^ g = exp(<iz;i).exp(t„u„) with properties 

(i)-(iii) from that result. To show that g-x £ N,we proceed by induction. Suppose 

we already know that gi ■ x £ N, where gt := exp(ti^iVi+i) .exp(t„z;„) and 

set Ai := {t £ [0,ti] ] (exp(toi) ■ gf) ■ x G A^}. Then 0 G and Ai is closed 
since N is. If t G then by local invariance of N there exists some a > 0 
such that for all jsj < a we have exp(s?;i) • [(exp(toi) • gi) • x] G A^. Here, hi := 
exp(txi) ■ gi £ Ux and exp(sxi) G Uhi-x- Also, for jsj < a and t -I- s G [0,ti], 
exp(sxi) • exp(txi) • gi = exp((s + t)vi) ■ gi £Ux bv 13.1.121 liiil. Hence IXLTI lil gives 
[(exp((s-|-t)xi)-gi)]-x = [exp(sxi)-(exp(tvi)- 5 i)]-x = exp(sxi)-[(exp(txi)-gi)-x] G N 
for these s, which establishes that Ai is also open in Altogether, Ai = 

and our claim follows. □ 


3.2.14 Example, (i) Let G 
with infinitesimal generator 


= 50(2) be the rotation group acting on M 


— y sin e, x sin e + y cos e) = —ydx + xdy 


43 

















Then = {x^ + = 1} is clearly invariant under G. We can also see this using 

Id. 2. 131 let F{x, y) := x"^ + y^ — 1. Then F is of maximal rank (= 1) on its zero set 
and v{F) = —2xy + 2xy = 0. 

(ii) Let H{x, y) = y^ — 2y+ 1. Then the zero set of F[ is the horizontal line {y = 1}, 
which is manifestly not invariant under G. Nevertheless, v{H) = 2xy — 2x = 
2x{y — 1) vanishes on this zero set. However, also TFl = (0, 2?/ — 2) vanishes there, 
so 13.2.131 does not apply. This demonstrates that the maximal rank condition in 
13.2.131 cannot be dropped. 

The following criterion will turn out to be useful many times later on: 

3.2.15 Theorem. (Hadamard’s Lemma) Let F : M"* M) (I < m) he of maximal 
rank on the subvariety Sp ■= {x € M \ F{x) = 0}. Then a smooth real-valued 
function /: M —>■ M vanishes on Sp if and only if there exist Qi, ■ ■ ■ ,Qi G C°°(M,]R) 
such that 

Wx G M : f{x) = Qi{x)Fi{x) H-h Qi{x)Fi{x). 

Proof. The condition is clearly sufficient. Conversely, suppose that / vanishes on 
Sp and let x G Sp. Since F is a submersion, by [101 1-1.7] there exists a chart (tp = 
(?/\ ... , 2 /™), Lf) centered at x such that F^[y) := Foip~^{y^, ... , 2 /”") = ( 2 /\ ... ,2/0- 
It follows that f^ := fo(p~^ vanishes on <f{U) fl ({0} x We can suppose that 

y:>(U) is a ball around 0 in R™. Then for y = {y', y") := (y^,..., y\ y ^^^,..., 2/™) 
we have 

Uiy',y'') = fpiy',y'') - Uio,y") = [ dtf^i,ty',y'')dt 

Jo 

Therefore, setting Q\j := Q), o ip G C°°(U) we obtain f\u = Qu ' ^i\u- 

On the other hand, if a; ^ 5^ then there exists some open neighborhood U oi x and 
some i G {1,..., /} such that Fi{y) ^ 0 for all y G U. Then we set Q\j := f /Fi 
and Q)j := 0 for j ^ f to again obtain f\u = Yl\=i Qjj ' Pi\u- Now pick a locally 
finite open cover (Ua)a^A of neighborhoods as above and a subordinate partition 
of unity (xa)a^A with supp(xQ,) C Uq, for all a. Then 

i i 

f = Y1 = =: 

a Oi i—1 i—1 

□ 

It follows that an equivalent reformulation of (13.2.41) is that for any v G Q there 
exist smooth functions Q^fj, : M ^ R {y, p = 1,..., 1) such that 

i 

v{F^){x) ='^Q^fj,{x)F^{x) v = l,...,l, X G M. (3.2.5) 

M=1 


3.2.16 Example. The functions Qi, in 13.2.151 are in general not unique: Let 
M = and F{x,y,z) := {x,y). Then the function f{x,y,z) = xz y'^ vanishes 
on Sp (which is the z-axis) and we have 

f = zFi+ yF 2 = (z- y)Fi + (a: + y)F 2 . 
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If / = '^vQvFu = then the differences := Qu — Qu satisfy the 

homogeneous system 

i 

'^R,{x)F,{x) = 0. (3.2.6) 

i/=i 

For such functions we have: 

3.2.17 Proposition. Let F : —>■ R* be of maximal rank on Sp = {x G 

M I F{x) = 0}. If Ri,...,Ri € R.) satisfy (13.2.61) . then each Ry vanishes 

identically on Sp- Equivalently, there exist Sf; G fy,/i = l,...,l) such 

that, for all X G M and all u = 1,... ,l 

i 

Ry = Y. (3.2.7) 

In addition, the Sf; can he chosen to be anti-symmetric: = —Sf. In this case, 

(13.2.71) is necessary and sufficient for (I3.2.6|) . 

Proof. As in the proof of l3.2.151 by using a partition of unity and suitable charts 
we may reduce the proof to neighborhoods of points with either xGSpoix^Sp. 

1) If X G Sp, using a chart as in l3.2.151 we may suppose that M is a ball in R™ and 
F = (x^,..., X™) !->■ (x^,..., xfy Then (13.2.61) reads 

i 

'^Ry{x)x''=0. (3.2.8) 

We show by induction that (13.2.81) implies the existence of a skew-symmetric matrix 
of smooth functions satisfying (13.2.71) . 

For I = 1, (13.2.81) reduces to Ri{x)x^ = 0 for all x, so by continuity Ri{x) = 0 for 
all X, and we can choose = 0. 

I — \ ^ h Setting x' := (x*+^,..., x™), (13.2.81) implies that i?;(0,..., 0, x*, x') • 
X* = 0, so in fact i?;(0,..., 0, x^, x') = 0 for all (x*,x'). Applying 13.2.151 to F := 
(x^,..., x"*) !->■ (x^,..., x*“^), it follows that there exist smooth functions {p. = 
1,..., 1 — 1) of X with Ri{x) = Ff{x)x^. By (13.2.81) . therefore, 

i-i i-i i-i 

0 = ^(i?^(x)x'^ + Si{x)x^x^) = ^(i?^(x) + Si{x)x^)x^ =: ^ R^x^■ 

^—1 /i = l ^—1 

By our induction hypothesis there exist smooth functions Sf; {I < yi,v < l — \) such 
that Ry = Ef:x^ ior 1 < V <l — 1 and such that Sf; is skew-symmetric. By 

definition, R^ = R^ — Sf x’' ioi 1 < yi < I — 1. Therefore, 


/fil \ 

Ri-1 


/ s\ . 

SU • 

■ s[-^ -s[ \ 

qI — 1 qI 

‘^Z-1 


/x^\ 

x'-i 

V Ri y 


1 SI . 

. sl-^ 0 y 


^ / 


giving the desired skew-symmetric I x I matrix S. 

2) Now suppose that x ^ Sp. Again we proceed by induction. For I = 1, Pi(x) ^ 0 
and Ri ■ Fi =0 imply that we can write i?i(x) = 0 = 0 • Fi =: S^ ■ Fi on a 
neighborhood of x where Fi ^ 0. 
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Next, assume that the result is already proved for I — 1. Since F{x) ^ 0, some 
component of F(x) must be non-zero, and without loss of generality we may suppose 
that Fi-i(x) ^ 0 (the proof below works the same way for any other component as 
well). Then on a neighborhood U oi x where Ti_i(x) ^ 0 we have 

+ ■ ■ ■ + R1-2F1-2 + (^Ri-i + = 0 , 

and so by our induction assumption (and shrinking U if necessary) there exists a 
skew-symmetric {I — 1) x {I — 1) matrix Sj^ such that 


/ Ri \ 


/ . 

■ s[-^ \ 


( F,\ 

Ri-2 

= 

SI 2 ■ 

qI — 1 
^1-2 


Fi-2 

\Ri-i+Ri^J 


\sl_, . 

■ slzl) 


\ Pi / 


Consequently, 


f Ri\ 


/ s\ ... 5^-1 0 \ 


/ F, \ 

Ri-2 

= 

SI 2 ... S\zl 0 


Fi-2 

Ri-i 


Sl_, ... Rizl -Ri/R-i 


Fi-i 

\ Ri y 


0 

0 

1 

0 

\ Pi ) 


which gives the claim also in this case. 

Conversely, if (13.2.71) is satisfied, then denoting by (, ) the standard scalar product 
on K* we have 

{R(x),F(x)} = {S(x)F(x),F(x)) = 0 

since = —S. This is (13.2.611 . □ 


3.3 Invariance and functional dependence 

Our goal in this section is to determine ‘how many’ invariants a given group action 
possesses. We first observe that if are invariants (either local or global) 

of a group G and F = F(z^, ..., z^) is any smooth function then also 

C(x) :=F(C'(x),...,C"(x)) 

is an invariant. However, ( is completely determined by and so adds no 

additional information. We therefore want to determine invariants up to this kind 
of relation. 

3.3.1 Definition. Tet (C^,..., ^^ G C°°(M, K). Then 

(i) ... ,^^ are called functionally dependent if for each x G M there exists a 

neighborhood U of x and a smooth map F : R.* —>■ R that does not vanish 
identically on any open subset o/R^ such that 

T’(C'(x),...,C"(x)) =0 (3.3.1) 


for all X G U . 
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(ii) are called functionally independent if they are not functionally de¬ 

pendent when restricted to any open subset U of M. Equivalently, if F 
as in (i) is such that (13.3.111 holds for all x in some open subset U of M 
then F{zi,..., Zk) = 0 for all z in some open subset of Mf (which contains 
(C\...,C'')(t^') for some U' QU). 

3.3.2 Example, (i) The functions x/y and xy /are functionally dependent 
on {{x,y) I y ^ 0} because 


xy 


^Iv 


2-2 _|_ 2^2 _|_ (xjyf 


= /(a;/y)- 


On the other hand, xjy and x-\-y are functionally independent where defined since if 
F{x + y,x/y) = 0 for (x,y) in any open subset of then since {x,y) i—>■ {x-\-y,x/y) 
is a local diffeomorphism, F has to vanish identically on some open subset of the 
image of this set. 

(ii) The functions 


T]{x,y)=x, C{x,y) = 


X y < 0 

X + y > 0 


are functionally dependent on {y < 0}, independent on {y > 0}, but neither on the 
entire space R^. 

For a characterization of functional (in)dependence we need two auxiliary results, 
both of which are of independent interest as well. 

3.3.3 Proposition. Let M be a second countable smooth manifold and let S be 
a closed subset of M. Then there exists a smooth function / : M —>■ R such that 
S = {x€M\ /(x) =0}. 

Proof. Since M \ S' is open and M is second countable, M \ S can be written as 
a locally finite union of countably many sets Vj {j G N) and there exist functions 
Xj S C°°{M) with Xj ^ 0 and Xj > 0 precisely on Vj (see the proof of O 2.3.10]). 
Then set 

/ -Hxj - 


Since {Vj)j is locally finite, locally around any point in M only finitely many sum¬ 
mands are nonzero, so f G C^{M). Moreover, for any x G M we have 

f{x) = 0 O Vj : Xj{x) = 0^x G Pi (M \ V,) = M \ IJ Ij = S. 

jeN iGN 

□ 

In the next theorem, by a critical point of a smooth map f: M ^ N we mean a 
point X G M where T^f is not surjective. A point y G N is called a critical value of 
/ if there exists a critical point x G M oi f such that y = /(x). 

3.3.4 Theorem. (Sard’s Theorem) Let f: M ^ N be a smooth map between 
smooth manifolds. Then the set of critical values of f in N has measure zero, in 
the sense that in any chart of N the image of the set of critical values in that chart 
domain has Lebesgue measure zero. 

A proof of this result would take us too far afield, so we refer to [i 3.2]. 

Based on the previous results we now have: 
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3.3.5 Theorem. Let ( = ..., be a smooth map from M toMf. Then the 

following are equivalent: 

(i) are functionally dependent on M. 

(a) Vx € M : rk(Ta;C) < k. 

Proof. (i)=^>(ii): Suppose that for some x G M we had rk(T 2 :^) = k. Since the rank 
can’t fall locally, there would then exist some open set U around x with rk(T 3 ,/C) = k 
for all x' G U. We may assume U small enough that there is some F as in 13.3.11 
with F{Cf{x'),... = 0 on U. Since C is a submersion on U, it is an open 

map. Thus F vanishes on the open set C{U), a contradiction. 

(ii)=^(i): Let U be open and relatively compact in M. Bv 13.3.31 there exists some 
F G such that F{z) = 0 if and only if 2 S ((U). Since CiU) C_[C(x) | 

rk(T 2 ,C) < k} and the latter set has measure zero by Sard’s theorem [3.3.4l f{U) does 
not contain any non-empty open set. Thus ..., are functionally dependent. 

□ 


3.3.6 Theorem. Let G act semi-regularly on the m-dimensional manifold M with 
k-dimensional orbits and let xq G M. Then there exists some open neighborhood U 
of Xq such that there are precisely m — k functionally independent local invariants 

..., of G on U. Any other invariant of G on U is of the form 

/(x) = ^’(ci(x),...,r-'=(x)) 

for some smooth function F. Lf the action of G is regular then the invariants can 
be taken to be globally invariant on a neighborhood of Xq. 

Proof. By 13.1.161 any orbit of G is a leaf of the /c-dimensional integrable dis¬ 
tribution A$. Thus the classical Frobenius theorem 12.1.11 yields a cubic chart 
{(p = (x^,..., X™), G) centered at Xq, p{U) = (—c, c)™, such that any orbit of 
G that intersects U does so in a (by 12.2.141 at most countable) union of slices 
Ua = X {a}). For 1 < i < m - fc, let C : G ^ R, C ■= x'^+^{= </?'=+*). 

Then by definition, each C* is constant on each slice, hence is locally constant on 
any orbit that intersects Lf, hence is a local invariant. Any other local invariant 
/ must be constant on each slice. In terms of the chart p this means that / does 
in fact not depend on the variables x^,... ,x^, and therefore is a function of the 
remaining variables (1 < * < fc) only, i.e., of the Q. 

Finally, if the action of G is regular, then bv 13.1.171 the chart {p, Lf) can be chosen 
such that each G-orbit intersects Lf in at most one slice. Then the C* are constant 
on this slice, i.e., on 17 fl 5, so they are global invariants. □ 

Since C = (C^i ■ • ■) maximal rank, the C* are functionally independent by 

13.3.51 Such a family of invariants (i.e., a functionally independent family such that 
any other invariant is a function of the members of the family) is called a complete 
set of functionally independent invariants. 

3.3.7 Proposition. Let G act semi-regularly on the m-dimensional manifold M 
with k-dimensional orbits. Then: 

(i) Any complete set of functionally independent local invariants of G has m — k 
elements. 

(ii) If nf,... is a set of functionally independent local invariants, then lo¬ 

cally around any point in its domain it is complete. 
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Proof, (i) Let r] = (ry^, ... ^rf) and (0^,..., 9^) be two sets of functionally indepen¬ 
dent invariants of G and assume that I > p. Since 9 is complete, there exists some 
smooth map F : il ^ W' with 17 C Rp open such that i] = F o 9. But then for any 
X, rk(Ta;r 7 ) < rkTe( 2 ;)(F) < p < I, a contradiction to 13.3.51 By symmetry, I = p, and 
by 13.3.61 it follows that in fact I = m — k. 

(ii) Suppose that / is some invariant defined on the domain 17 of ry = ..., 

and let x S 17. Also, pick C = (C^) • ■ •) around x as in l3.3.6l Then there exists 

some F : —>• R"*”^, such that rj = F o(^ near x. Since rk(ry) = rk(C) = m — k, 

we also must have rk(J^) = m — k, i.e., F is a local diffeomorphism. Since / is a 
smooth function of ( near x it is therefore also a smooth function of ?y. □ 


3.3.8 Proposition. Let G act semi-regularly on the m-dimensional manifold M 
with k-dimensional orbits and let be a complete set of functionally 

independent invariants defined on an open subset U of M. If a subvariety Sp = 
{x € M \ F(x) = 0} (with F G C°°(M, R*)j is G-invariant, then for each solution 
xo G Sp L\ U there is a neighborhood U Q U of xq and a smooth function f such 
that the solution set of the corresponding invariant F = x /(^^(x),..., Cf^~^{x)) 
on U coincides with that of F, i.e., 

Spnu = Spnu = {xGU\ /(c'(x),..., r~Hx)) = o}- 


Proof. Since (by 13.3.511 the rank of ..., is m — k on U we may find 

coordinates (?/^,..., y/™) on a neighborhood {/ of xq such that y/* = Q for 1 < 
i < m — k. In fact, the remaining k coordinates can be determined by picking 
suitable x*^ (called parametric variables) from any given coordinates (x^,... ,x'"). 
The change of coordinates is then of the form yy = y/;(x) = (C(x),x), with x the 
parametric variables. 


We show that these coordinates are then flat for A$: Each C-l is constant on the 
orbits of A$, hence ii vi,... ,Vk G g are such that A$(x) = span(ui|a;,...,Xfc|x) 
then 




dt 


C^(Fir(x)) = 0 

0 


for all X £ U. Therefore, Vi\x G nj=i ker(T'a;(^7) = Xx((, and since dim(ker(T'a;^)) = 

k, A<i,(x) = span(ui|a ,,... ,Vk\x) = kerT^,!^. On the other hand, ker(Ta;(C) is spanned 
by dym-k+i,... ,dym because TCf{dyj) = = 0 for i G {!,...,m — fc} and 

j G {m — fc -I- 1,. .., m}. 

In terms of the new variables we can write Ffx) = F*{y) = F*{f{x),x), where 
F* := F o Denoting by xq the value of the parametric variables at Xq we now 
set f(z) := F*{z,Xo), and 


Fix) := fiCix)) = fiCix),..., C'^ix)). 


By assumption, Sp is G-invariant. Also, the orbits of G intersect U in the slices 
{((x) = c}. Therefore, F(x) = F*(^(x), x) = 0 if and only if F(x) = F*(^(x), Xq) = 
0 since {({x),x) and (((x),Xo) lie in the same slice. □ 

We now turn to the problem of actually calculating invariants of a local transforma¬ 
tion group. To begin with, we consider the case of a one-parameter group G with 
infinitesimal generator v G Xioc(Ff)(M). In terms of local coordinates x^,... ,x"' 
we can write 

V = ^^(x)dxi H-l-^"‘(x)dxm 
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for certain smooth functions Then bv l3.2.71 a local invariant / of G is a solution 
of the linear homogeneous first order PDE 

»(/) = «'(*)^ + ■ ■ ■ + 

Bv l3.3.6l we know that if v\x ^ 0 then locally around x there exist m— 1 functionally 
independent invariants, i.e., m — 1 functionally independent solutions of (13.3.21) . In 
PDE theory, the method of choice for solving (13.3.21) is the method of characteristics., 
cf. [H Sec. 3.2]. This basically consists in solving the corresponding system of ODEs 
given by 

-^=C{x{t)) (3.3.3) 

Solutions of (13.3.21) are then functions that are constant along solutions of (I3.3.3p . 
Such functions are also called first integrals of (I3.3.3P . In this terminology, I3.3.6l savs 
that (|3.3.3I) possesses m — 1 functionally independent first integrals locally around 
any point. 

3.3.9 Remark. Using the straightening-out theorem (cf. [TT] 17.14]) it is very 
easy to see all of the above directly. In fact, in an appropriate coordinate system 

,..., we have v = dyi near the point x. It is then immediate that the functions 
y'^,, y™ form a complete set of functionally independent invariants of EU. Eur- 
thermore, in these coordinates (|3.3.3I) reads ^ = 1, and ^ = 0 for i = 2,..., m, 
with general solution y^{t) = t + ci, y^ = Ci tor i = 2,... ,m. A complete set of 
functionally independent first integrals then of course is also given by ... ,y™, 
because these functions are constant along this general solution of the ODE. 

3.3.10 Example. Consider the rotation group SO{2) on \ {(0,0)} with in¬ 
finitesimal generator v = —ydx + xdy. Then the characteristic system reads 

dx dy 

— = —y — = X. 
dt dt 

A first integral is f{x, y) = x"^ + y^, since 

jJ{x(t),y{t)) = dxfix(t),y{t)){-y{t)) + dyfix{t),y{t))ix{t)) 

= —2x(t)y(t) + 2x(t)y{t) = 0. 

Any other first integral, i.e., any other invariant of v, is a function of /. 

In the remainder of this section we want to explore how invariance of functions 
or subvarieties under a local transformation group can be expressed in terms of 
quotient manifolds, using [5T. 191 The main result is as follows: 

3.3.11 Theorem. Let G be a local transformation group on M that acts regularly 
on the m-dimensional manifold M with k-dimensional orbits. 

(i) A smooth map F : M ^ is G-invariant if and only if there exists a smooth 
function F : M/G —>■ K.* such that F{x) = F{tt{x)) for all x € M. 

(a) Let F G C°°(M,R*). Then the corresponding subvariety Sp = {x G M \ 
F(x) = 0} is G-invariant if and only if there exists a smooth map F : M/G ^ 
R* such that, for all x G M, F{x) = 0 if and only if F{tt{x)) = 0. 

(Hi) An n-dimensional regular submanifold N of M is G-invariant if and only if 
there exists a smooth (n — k)-dimensional regular submanifold N = N/G of 
M/G such that N = (and therefore N = tt(N)). 
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Proof, (i) By definition, tt is invariant under G, so any F o n is G-invariant as 
well. Conversely, suppose that F G K.*) is G-invariant. Using a regular chart 

{U,(p), the proof of 13.3.61 together with 13.1.201 shows that on Tr{U) there exists a 
smooth map Fjj such that Fjj o tt = F on U. The sets tt{U) form a covering of 
M/G by chart neighborhoods (since tt{U) = U'), so we may find a partition of 
unity (xj) on M/G subordinate to it, say with suppxj g U/ for all j. Then setting 
F := Xj • Fuj it follows that, for any x G M, 

F{7r{x)) = ^xj(7r(x)) • Fu^{tt{x)) = '^Xj{'^{x)) ■ F{x) = F{x). 
j j 

(ii) Suppose first that there exists some F G C°°{M/G,W) such that F{x) = 0 if 
and only if F(7t(x)) = 0. Then if y is in the same orbit as x, tt{x) = 7r(j/). Thus, if 
F{x) =0 then 0 = F{tt{x)) = F(7r(y)), which in turn is equivalent to F{y) = 0. It 
follows that 5^ is G-invariant. 

Conversely, if is G-invariant then bv 13.3.81 (together with l3.1.'2ni) it follows that 
any point in M has a neighborhood U such that there exists some smooth map 
Fjj on U' = tt{U) with the property that, for all x G U, F{x) = 0 if and only if 
Fu{tt{x)) = 0. Let Fu{z) = {Fi{z),... ,Fi{z)). Then we may replace Fj by (i^i)^ 
for all i G {!,...,/} while retaining the property stated above. Hence without 
loss of generality we may assume that each Fj is non-negative. As in (i), pick a 
partition of unity (xj) on M/G subordinate to a covering by such neighborhoods 
U'j (j G N) and set F := J2jXj ' Fuj- Let x G M with F{x) = 0 and suppose 
that 7r(x) G U'j = T^(Uj). Then for some Xj G Uj we have Tr{x) = 7r(xj), so since 
Sf is G-invariant we obtain Xj G Sp as well. Hence 0 = Fij.{7r{xj)) = Fu.{tt{x)) 
for any such j, implying F{tt{x)) = 0. Conversely, suppose that F{7r(x)) = 0 and 
pick k such that Xfc(7r(a;)) > 0. Then since each F/j^ is non-negative, it follows that 
Fu^(tt(x)) = 0. Since U/ = Tr(Uk), there exists some Xk G Uk with TT{xk) = 7r(x). 
By construction of Fj/^, therefore, F{xk) = 0, i.e., Xk G Sp- But then also F{x) = 0 
since Sp is G-invariant. 

(iii) Suppose that N = tt~^{N) and let x G iV and g G G such that y := g- x exists. 
Then TT{y) = tt(x) G N, so y G 7r~^(JV) = N. Thus N is G-invariant. 

Conversely, if iV is a G-invariant regular submanifold of M and U ^ M denotes 
the action of G on M (cf. 13.1.11) . then U' := Li (1 {G x N) is open in G x iV and 
contains {e} x N. Also, $: G' M is smooth. Moreover, $(W') C N since N is 
G-invariant and because N is regular it follows that ^: 14' ^ N is smooth, hence 
is a local transformation group. Let x G N and let S be the G-orbit of x. Then S 
is a regular submanifold of M (bv 13.1.6]) and is contained in N. Thus by [TTJ 13.6] 
it is an immersive submanifold of N. Also, it carries the trace topology of M, as 
does N, so in fact it carries the trace topology of N, i.e., it is a regular submanifold 
of N. Furthermore, if H is a neighborhood basis of x in M such that S' DU is 
connected in S' for every orbit S' and every U G B then {(7nlV| U G B} has the 
same property in N (since it carries the trace topology of M), so G acts regularly 
on N as well. We are therefore in a position to apply 13.1.191 to ^ W ^ N and 
obtain that 7r|vr : N ^ N/G is a. surjective submersion and N/G = tt{N) =: N 
is an (n — fc)-dimensional manifold. Since N is G-invariant, if y G 7r“^(A^) then 
7r(y) = 7r(x) for some x G N, so y G N, i.e., N = 7r“^(7V). 

It only remains to prove that is a regular submanifold of M/G. To see this, we 
will proceed analogously to the proof of 12.2.141 Let x G N, let S be the orbit of 
X (so S C N), and pick vi,... ,Vk G g such that {xi(x),... ,w/c(x)} is a basis of 
A$(x). Next, choose an adapted chart (x = (y^, ■ ■ ■ W) around x in M such 
that ((y^,..., y"), WDN) is a chart of N, x{^) = 0 G M™, and x(fLnA) = {y"'*'^ = 


51 


















• • • = J/"* = 0}. Also, we may suppose that '^ 1 ( 0 ;), ...,Vk{x), 
a basis of T^N, and vi{x),... ,Vk{x), 


a 

dy"- 


IS 


a 

ay’’" 


is a basis oi T^M. Let 


f{t\ ...,n:= (Fi;’,^ o ... o Fi;’^)(x-1(0,..., 0, , n). 

Then / is a diffeomorphism from some neighborhood {—d, d)'^ of 0 G onto a 
neighborhood of x in M, and we take (p := f~^ on a suitable neighborhood U oi x 
to obtain that {U, (p) is a chart of M around x. Furthermore, since x is an adapted 
chart and N is C?-invariant, 




maps into N and can (by making d smaller if necessary) also be assumed to be a 
diffeomorphism. This means that {(f, U) is also an adapted chart for N. 

Since S is an orbit of G, 


y€S^Y\llo...o¥V;t{y)€S 

for all y and where the right hand side is defined. Therefore, for any 

y = f{t ^,..., t™) € U we have 

y = f{t\ ...,nGS^ /(O,..., 0, ...,nGS. (3.3.4) 

This means that U D S is the disjoint union of connected sets of the form 

Uc ■■= {y G U \ x'^+^iy) = Ck+i,.. • ,a;™(y) = Cm] 

where c = {ck+i, ■ ■ ■ ,Cm) is constant. Since G acts regularly, we may suppose in 
addition that 17 fl S' is connected. Then since /(O,..., 0) = x S S it follows that 
the only non-empty Uc is Uq, so S DU = {y G U \ x^+^(y) = • • • = x”^{y) = 0}. 

Summing up, we have that ( 93 , U) is a flat chart for G on M and {p, U (1 N) is a 
flat chart for G on N. Since G acts regularly, the proof of 13.1.171 shows that we 
may in addition assume that U is so small that {(p, U) is regular for A$ on M and 
{p, UnN) is regular for A<i, on N. Then bv l3.1.201 on U' = ^{U) we obtain a chart 
p' for M/G around 7 r(x) by 

p':y'^ {x'^+\y),x^{y)), U' ^ 

with y any element of Tr~^{y') n U. By the same reasoning, 

y' ^ (x'^+i (y),..., x"(y)), U'n N ^ 

(with y any element of H U (1 N) is a chart for N = N/G around 7 r(x). 

This shows that {p',U') is a chart adapted to N, i.e., iV is a regular submanifold 
of M/G. □ 


3.4 Dimensional analysis 

In a typical physics problem, there are certain fundamental physical quantities, such 
as length, time, mass, ..., which can all be scaled independently of each other. Let 
z^,... ,z'^ denote these quantities, then this scaling can be described by the action 
of a scaling group 

(z\...,z’') {Xiz\...,XrZ^), 
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with fixed scaling factors A = (Ai,..., Xr) G R’’. The underlying group therefore is 
the r-th power of the multiplicative group M“'". 

Furthermore, there typically are certain derived quantities, such as velocity, accel¬ 
eration, force, density, ..., which also scale if the fundamental quantities are scaled. 
Call these quantities x = ..., cc™). Then the action of the scaling group on the 

derived quantities takes the form 

A - (x\...,x™) = (A““A“=V..A“"ix\...,A“^’"A“^™A“'-™x™), (3.4.1) 

where i = l,...,r, j = l,...,misa matrix of real numbers determined by the 
problem at hand. For example, if is length, is time and is mass, then the 
action of scaling on velocity x^ and force x^ are given by 

A • {x^^x^) = (AiA^^a;^, AjA^^Aaa;^). 

If a derived quantity is invariant under the corresponding scaling then it is called di¬ 
mensionless. To describe some physical situation, one often has functional relations 
of the form F{x^,... ,x"^) = 0 for the derived quantities. Such a relation is called 
unit-free if it remains unchanged under a rescaling of the fundamental quantities. It 
turns out that such unit-free relations are often of great physical significance. The 
Buckingham Pi-Theorem shows that any unit-free relation can be written solely in 
terms of dimensionless quantities. 

3.4.1 Theorem. (Buckingham Pi-theorem) Let z ^,..., z*" be fundamental physical 
quantities that scale independently according to z* i—AiZ*. Let x^,..., x™ be derived 
quantities that scale according to (13.4.1|) for some (r x m)-matrix of constants A = 
(aij) and let s be the rank of A. Then 

(i) There exist m — s functionally independent dimensionless ‘power products’ 

TT^ = fc=l,...,m —s (3.4.2) 

such that any other dimensionless quantity can be written as a function of 
TT^,..., The columns of the matrix B = {(djk) in (13.4.21) can be taken 

to be any basis o/kerA. 

(a) If F{x^,... ,x™) = 0 zs any unit-free relation among the given derived quan¬ 
tities, then there is an equivalent relation F = 0 which can be expressed solely 
in terms of the above dimensionless power products: 

F{x) =0^ F{'k\x),...,tt'^-’^{x)) = 0. 

Proof. The proof will basically be an application of 13. 3. Ill As the underlying 
manifold M we take the positive octant in K™, so 

M = {x = (x^,..., x™) I X* > 0, z = I,..., m}. 

Then the multiplicative group G = (R+)’' acts globally on M via ()3.4.II1 . Since 
the action is multiplicative, its generators are found by differentiating (13.4.Ill with 
respect to Ai and then setting Ai = • • • = A^ = I: 

d d d 

Vr = OiiX^TTT + -1" z = I,..., r. 

ox^ ox‘‘ ax’” 

It follows that the dimension of the span of vi,... ,Vr is precisely the rank of A, 
namely s, so G has s-dimensional orbits. For a function / to be a global invariant 
of G, bv 13.2.71 it has to satisfy 

Viif) = 0, z = I,...,r. 
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In particular, if / = TTfc is given by (13.4.21) . then this condition is satisfied if and 
only if the exponents f3jk satisfy the system of linear equations 

m 

^Q;y/3jfc = 0, i = l,...,r. (3.4.3) 

Since rk(A) = s, there are m — s linearly independent solutions to this system. 
Choosing these solutions for the fijk, let tt := (tt^, ... ,7 r’"“'*). Then the Jacobian 
of TT at X = (1 ,..., 1) is the transpose of the matrix {/3jk), j = k = 

1,... ,m — s, and its rank therefore is m — s. For any a > 0 we have (as matrix- 
multiplication) 

7 r(ax^,x^,... jX"^) = diag(a^“,... ,a^i('"-'’>) • 7 r(x^,x^,... ,x™), 

so since iJ7r(l, !,...,!) has a non-zero {m — s) x (m — s)-sub-determinant, the same 
is true of Dir^a, !,...,!). Analogously we can argue for varying the x^-component, 
and so on. Altogether, it follows that the rank of the Jacobian of tt is {m — s) every¬ 
where on M. Thus by (the easy direction of) 13.3.51 (tt^, ..., tt™”'*) is a functionally 
independent system of invariants of G on M. 

Next we show that the orbits of G are precisely the level sets of the function tt. To 
see this, let x, x be any two points in M. Then by the definition of M there exist 
exponents tj, j = 1,... ,m such that = e*-* • x^ for all j. Therefore 

7 r*(x) = ^P 2 kt 2 _ ^ ^ (x) 

m 

Pjktj = 0, 

J = 1 

so X and x lie in the same level set of tt if and only if the tj satisfy 

m 

'^I3jktj=0, fc=l,...,m-s. (3.4.4) 

i=i 

By construction, the columns of the m x (jn — s)-matrix B := {(3jk) form a basis of 
ker A, so in particular A ■ B — 0. Moreover, (13.4.41) means that B^ ■ t = 0, where 
t = , t™)^, i.e., t G ker{B^). We claim that this, in turn, is equivalent to the 

existence of si,... ,Sr with 

r 

tj='^Siaij, j = l,...,m (3.4.5) 

i.e., to t S im(A^). This means we have to show that im(A^) = ker(i3^). Now if 
w = A^u € im(A^), then B^w = B^A^u = (AB)^u = 0, so im(A^) C ker(i?^). 
On the other hand, dimim(A^) = dimim(A) = s, and also dimker(i3^) = m — 
dimim(i3^) = m — dimim(i3) = s, so indeed we have equality. 

Hence two points x and x in M have the same image under tt if and only if x = Ax, 
where Xi = i.e., if and only if they lie in the same orbit under G. Since tt has 

maximal rank m — s everywhere, its level sets are regular submanifolds of dimension 
s, so G has s-dimensional regular submanifolds as orbits. Moreover, the proof of 
13.3.81 shows that, around any point, tt^, ... can be completed by a set of s 

parametric variables to form a flat chart for the group action of G. In the present 
situation, such parametric variables can even be chosen globally on all of M : indeed, 
pick numbers (3jk, j = l,...m, k = m — s-|-1 ,...,to that supplement the matrix 
B from above to obtain an invertible m x m matrix B and set 

m m 

n{x) := (]^(x*)^*\...,]^(x*)^*"*y 
2=1 2=1 
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Then the first k components of if are precisely tt, and the remaining m — s com¬ 
ponents are our new parametric variables. By the same reasoning as above for tt 
it follows that if has rank m everywhere on M, hence is a local diffeomorphism. 
In addition, if is injective: to see this, note first that if(a:) = if(j/) implies x = y 
if and only if if*(a:) = 1 implies x* = 1 for all i. Now if if® (a;) = 1 for all i then 
taking logarithms it follows that W ■ (ln(a:^),... ,ln(a:™))^ = 0, which due to the 
invertibility of B indeed implies a;® = 1 for all i. Thus we obtain the desired global 
chart if for M. 

Any orbit intersects this chart in a single slice (tt = const.), so the action of G 
is regular. Moreover, bv 13.1.21)1 tt induces global coordinates on M/G, hence can 
be identified with the quotient map M ^ M/G (and M/G can be identified with 
^jjgj-gfQpg jjj ^}jg position to aoplv IXd.llI Since being G-invariant by 
definition is the same as being dimensionless, (i) follows from l3.3.11l fil. Moreover, 
an equation is unit-free if and only if it is G-invariant, so (ii) follows from 1573.111 
(ii). □ 


3.4.2 Example. The energy yield of the first atomic explosion. In 1947, when 
the amount of energy released by the first atomic explosion was still classified, G. 
Taylor calculated this energy using dimensional analysis (see m, our presentation 
follows [ 2 ). 

We model the explosion by the radius R of the spherical fireball emanating from 
the point of explosion. We assume that i? is a function 

R = (3.4.6) 


where 


x^ = E, the energy released by the explosion, 
x^ = t, the time elapsed since the explosion, 
x^ = Po, the initial ambient air density, 
x"^ = Pq, the initial ambient air pressure. 

R itself also is a derived quantity, so we set x^ = R. It follows that for this 
problem m = 5. The fundamental physical quantities needed to describe the derived 
quantities are = length, = mass, and z^ = time, so r = 3. The matrix 
A = has the form 


/ 2 0 -3 -1 1\ 

A = 1 0 1 1 0 

\-2 10-2 0 / 

E.g., energy has dimension length^ x mass/time^, hence the first column. The rank 
of this matrix is s = 3, so m — s = 5 — 3 = 2. Therefore, bv 13.4.11 there are two 
functionally independent dimensionless power products tt^, tt^, in terms of which 
any dimensionless quantity can be described. By (13.4.2L these power products are 
of the form 


TT^x) = (xy^^(xy^^(x^f^^(xy^^(x^f^^ 

7r^(x) = . 
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From the proof of l3.4.1l we know that the 5x2 matrix B = {Pjk) can be constructed 
by finding a basis of ker A. It follows that 


so 


B = 


(-2 - 1 \ 
6 -2 
-3 1 

5 0 

VO 5/ 


7r^(a;) 




{x^)-\x^)-^{x^){x^f -- 


■ E^pI 
R^Po 
Et^ ■ 


Bv l3.4.1h ii'l. the relation (13.4.611 can equivalently be expressed in the form F(7r^, tt^) = 
0, and we assume that in fact we can solve for tt^. Thus there exists a smooth func¬ 
tion g of TT^ such that p.4.6p is equivalent to = g{Tr^), i.e., to 

j g{n^). (3.4.7) 


for some function g. By continuity, g(7r^) m g{0) for tt^ small, so Taylor derived the 
approximative formula 




with A = 



Here, po is known, and to find g{0) one can plot logj^Q R versus logj^g ^ using data 
from experiments with conventional explosives (whose E is known). Taylor then 
used motion picture records of the first atomic explosion, due to J.E. Mack, to plot 
5/2 logj^g R versus logj^g t (in c.g.s.-units, but as we know any other system of units 
would have given the same result!), to obtain an accurate estimate for E: 

E Ri 7,14 X 10^° ergs = 7,14 x lO^V, 


which corresponds to the energy release of about 16800 tons of TNT. 


3.5 Groups and differential equations 

We now want to start applying symmetry methods to differential equations. To 
this end we consider a system S of differential equations involving p independent 
variables x = {x ^,..., x^) and q dependent variables u = (m^, ..., u‘^). Solutions of 
S will be functions u = f{x), or, in components, 

■u“ =/“(xV ... ,xP) {a = l,...,q). 

Henceforth we will use the convention of using Latin indices for the independent 
variables and Greek indices for the dependent ones. We let X = be the space of 
independent variables and t/ = K'* the space of dependent variables. Then basically 
a symmetry group of S will be a local transformation group G acting on M := XxU 
in such a way that it ‘transforms solutions of S into solutions of S. 

We first need to clarify how such a group of transformations is to act on a function 
/. The key idea here is to identify /: X D —>■ 17 with its graph 

T/ = {{xj{x)) I a: e G}. 
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Then T/ is a p-dimensional submanifold of M. li g G G and Tf Q Ug, the domain 
of g, then g acts on T/ by 

g-Tf := {(i.u) =g - {x,u) \ (x,u) G T/}. 

Note that g-Tf will in general no longer be the graph of a function. However, 
since G acts smoothly and e • Ty = F/, by shrinking H and restricting to elements 
of G near to e we can always achieve that g-Tf = T where / (corresponding to 
u = f{x)) is a well-defined function. In this case we write f = g ■ f and call / the 
transform of / by g. 

3.5.1 Example. Let X = M., U = so p = g = I and again consider the rotation 
group G = SO{2) on M = X X U = The action of G is given by 


{x,u) = £ ■ {x,u) = {x cos £ — u sin e, x sin £ + u cos e) 


(3.5.1) 


Now if /: a; I—u = /(x) is a function then G acts on / by rotating its graph, 
and in general this rotated graph will not be the graph of a well-defined function: 
think of a straight line or a parabola. The example of the parabola in particular 
demonstrates that it will in general not suffice to choose g near e (i.e., e near 0) but 
that one has to also restrict the domain of /. But for |e| small and restricting / to 
a sufficiently small domain we do obtain a well defined function £•/ = /. 

To explicitly calculate the action of G on a linear function, let u = f{x) = ax -\-b. 
Then any point in F/ is of the form (x, ax + b), which is rotated by e to the point 

(x,u) = (x cos e — (ax + b) sin e, x sin £ -f (ax + b) cos e). 

To determine u = f(x) we have to eliminate x from this equation, which is possible 
for £ small (so that cot£ ^ a). Then 


X = 


X + b sin £ 


cos £ — a sm £ 


and therefore 


sin £-I-a cos £ _ 
u = j(x) = - ^—X ■ 


cos £ — a sm £ cos £ — a sm £ 
which, as expected, is again a linear function. 

The general procedure for calculating f = g ■ f from / is as follows. Write 

(x,u) = g-(x,u) = (^g(x,u),^g(x,u)), (3.5.2) 

with Sg, smooth. The graph T ^ = g - T f of/ = (/• /is then given parametrically 
by the equations (for x € fl) 

X = Eg(x, f(x)) =EgO (id X f)(x) 
u = '^g(x, f(x)) = o (id X f)(x) 

To calculate / we have to eliminate x from these equations. For g = e, Seo(id x /) = 
id, hence for g in a neighborhood of e the Jacobian of Eg o (id x /) is nonsingular 
and so by the inverse function theorem we can locally solve for x: 


Consequently, 


= [SgO(idx/)] l(i). 


• / = [^s o (id X /)] O [Eg o (id X /)] b 


(3.5.3) 
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3.5.2 Lemma. Let f be a local smooth map, and let g, h G G. Then 

^ ' {9 ' f) = (h ■ g) ■ f, wherever defined. 

Proof. Since any function is uniquely determined by its graph it suffices to note 
that 

■ 9 )^f = h - {g ■ F/) = h ■ Tg.f = 

□ 

An important special case where / is automatically defined unrestrictedly is that 
of a projectable group action. For such actions, Sg is a function of x only, i.e., 

{x,u) = g- {x,u) = {Eg{x),<^g{x,u)), 

and so f{x) = [<I)g o (id x /)] o Using this terminology, we can now define: 

3.5.3 Definition. Let S be a system of differential equations. A symmetry group 
of S is a local transformation group G acting on an open subset of M = X x U 
such that whenever f is a solution of S and g ■ f is defined for some g G G then 
also g ■ f is a solution of S. (Here solution means any smooth solution defined on 
any subdomain fl of X.) 

3.5.4 Example, (i) Let S consist of the equation Uxx = 0. The solutions are 
precisely the linear functions on M. Since G = SO{2) transforms linear functions to 
linear functions, it is a symmetry group of S. 

(ii) Let S = {ut = Uxx}, the heat equation, and consider the group action 
{x,t,u) 1 -^ {x + ea,t + sb,u) (e G K). 

This is a symmetry group of S since f{x — ea,t — eb) is a solution of the heat 
equation whenever / is. 

One immediate advantage of knowing the symmetry group of a differential equation 
is that it gives a straightforward way of constructing new solutions from known ones 
(sometimes even from trivial ones, e.g., constant solutions) simply by applying the 
group transformations. Our aim ultimately is to derive infinitesimal criteria for 
a local transformation group to be a symmetry group of a differential equation, 
which can then be used to explicitly calculate the full symmetry group of any given 
differential equation. For this purpose we first need to develop some more technical 
machinery. 


3.6 Prolongation 


In the previous section we identified any smooth function with its graph in order 
to define the action of a transformation group on it. Since we are now interested in 
differential equations we need to find a way of simultaneously considering a function 
and its derivatives up to a certain order. The mechanism we are going to employ is 
that of prolongation, a simplified version of the theory of jet bundles. 

Given a smooth function f: W ^ M., x i-G- f{x^,... ,x'p) and any k G Ng, we let 
J = (ji,..., jfc) be an unordered /c-tuple with 1 < ji < p for all 1. The order of this 
tuple is defined as jl J = fc, and the corresponding partial derivative of / is 


d.jf{x) 


d’^f 

dxA dxL ... dxA 
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There are 

(p + k- l\ 
k ) 

different possible fc-th order partial derivatives of /. To see this, note that since the 
order of derivatives is irrelevant we may first bring J in ascending order, and then 
uniquely describe any possible choice by writing * for each number that appears in 
J and I to signify that we increase the number. For example, if p = 5 and k = 2, 
the string * * |||| corresponds to the tuple (1,1), | * | * || to (2,3), and (||| * |*) to 
(4, 5), etc. We need k stars and p—1 vertical lines, so our problem is equivalent to 
determining the number of possible selections of k elements from ap— 1 + fc-element 
set, hence the above formula. 

li f: X ^ U with X = and U = R®, so u = f{x) = (/^(x),..., f‘>{x)), then we 
need q-pk numbers Uj = djf°‘{x) to represent all possible fc-th order derivatives of 
/ at any point x. Therefore, we set Uk '■= and write Uj, with a G {1,..., g} 
and J any unordered multi-index as above. Moreover, we let 

[/(") ■= U X Ui X ■ ■ ■ X Un. 

The coordinates of can be used to represent all the derivatives of any map 
/: X —>■ [/ of orders 0 to n. The dimension of is 

We will denote elements of by with q ■ components Uj, with a € 
{1,... ,( 7 } and J = {ji,..., jk) an unordered multiindex, 1 < ji < p, 0 < k < n. For 
k = 0 there is only one such multiindex, denoted by 0 and Uq is the component u°‘ 
of u. 

3.6.1 Example. Let p = 2, g = 1. Then X = R^ has coordinates (x^,x^) = {x,y), 
and {7 = R has the coordinate u. Ui equals R^, with coordinates {ux, Uy), represent¬ 
ing all first order derivatives of u. Also, U 2 = R^ has coordinates {uxx,Uxy,Uyy), 
representing all second order derivatives of u. In general, Uk = R^^^ since there 
are k + 1 derivatives of u of order k, namely gJ^Qyk-i ; i = 0,... ,k. Also, = 

U X Ui X U 2 = R®, with coordinates = {u;Ux,Uy-,Uxx,Uxy,Uyy), representing 
all derivatives of u with respect to x and y of order at most 2. 

Now for any smooth function f: X ^ U we define its n-th prolongation pr*^")/: X —>■ 
[/("), m(") = pr(”)/(x) by 

u'} = djr{x). 

Thus for any x, pr*^")/(x) is a vector with q -p^"^ entries representing the values of 
/ and all its derivatives up to order n at x. In this sense we might also identify 
pr("^/(a;) with the n-th Taylor polynomial of / at x. 

3.6.2 Example. For p = 2, g = 1, we have u = f{x,y). Then the second 

prolongation of /, = pr(^^/(x,p) is given by 

{U',Ux,Uy',UxXjUxy,Uyy) = ( / ! /, /, 9,^,/, 9^;^/, /) , 

evaluated at (x,y). 

The space X x is also called the n-th order jet-space of A x [/, and pr(”^/(a;) 
is called the n-jet of / at x. If M is an open subset of A x [/, then we set 

:= M X Ui X ■ ■ ■ X Un. 

If n = f{x) is a function whose graph lies in M then the n-th prolongation pr^"^/ 
is a function whose graph lies in 
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3.7 Systems of differential equations 


For a system S of n-th order differential equations in p independent and q dependent 
variables we write 

P,(x,u(”)) = 0 = 

where x = , x^), u = {u^ ^, u'^). Here the functions 

are assumed to be smooth, i.e., P € C°°{X x C/("\K^). The differential equa¬ 
tions in S determine where this map vanishes on X x i.e., they determine a 
corresponding subvariety 

5p = {(x, ) I P(a;,«(”)) = 0} C X X C/(”) 

of the total jet space X x We may therefore identify the system S with this 

set Sp, and we shall do so at many places below. 

A solution of 5 is a smooth function u = f{x) such that 
Pi.(a;,pr(”V(a;)) = 0 , v = 

for all X in the domain of /. Geometrically, this means that the graph of the 
prolongation pr^"^/ is contained in iSp: 

= {(a;,pr("-)/(a:))} QSp = {P{x,u^'^'^) = 0}. 


3.7.1 Example. Consider the Laplace equation in two variables: 

'^xx T ^yy ~ 0. (3.7.1) 

Here, p = 2, q = 1, and n = 2 since the equation is of second order. The coordinates 
of A X are {x, y, u\ Ux,Uy', u^x, Uxy, Uyy), and (13.7.111 describes a hyperplane Sp 

in A X A function / is a solution of (13.7.111 if the graph of pr^^^/ is contained 

in Sp. For example, let f{x,y) = x^ — ixy^. Then 

{x, y, y)) = {x, y\x^ - ixy^; - 3?/^, -&xy\Qx, -6j/, -6x) e Sp. 

3.8 Prolongation of group actions 

In this section, given a local transformation group G on an open subset M of X xU, 
we will construct an induced action pr^^^G, the n-th prolongation of G, on 
The idea is to define pr^^^G in such a way that it transforms the derivatives of any 
smooth function u = f{x) into the corresponding derivatives of the transformed 
function u = fix). 

Thus let (ccojWo"^) ^ and choose any smooth function u = f{x) defined in a 

neighborhood of xq whose graph lies in M and such that it has the given derivatives 
at xo, i.e., such that = pr("’)/(xo). To see that such an / indeed exists, one 
may take it to be an appropriate Taylor polynomial: 

Fix) ■='^^ix - xo)-^, a = l,...,q. (3.8.1) 

J 
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Here the sum is over all J = (ji,..., j^) with 0 < k < n, and 

{x — xqY := (x^^ — x^^)(x^^ — xi^)... {x^'^ — x^q ). 

Also, for given J we set J := (ji,..., jp), where ji equals the number of j^’s which 
equal i. For example, if J = (1,1,1, 2,4,4), p = 4, fc = 6, then J = (3,1,0, 2). 
Finally, J = ji\...jp\. 

If g is close to e then the transformed function f = g ■ f (as in (13.5.311 1 is defined in 
a neighborhood of (xo,uo) = g ■ (xo,uo), where uq = f(xo). We then define 

where 

4"^ = ■ /)(io)- (3.8.2) 

An important point to note here is that, due to the chain rule, the value of pr("^p • 
(cco, Wq"^) depends exclusively on the derivatives of / at the point xq up to order n, 
i.e., only on (xq, Uq"^) itself. It is therefore independent of the choice of the auxiliary 
function / from above and therefore provides a well-defined operation on M^'^\ 

3.8.1 Lemma. For any local transformation group G on M as above, prl"lG is a 
local transformation group on 

Proof. We first note that {g,{xo,u^^)) !->■ pr^^^g ■ (a;o,Wo”^) is smooth. Indeed, 
taking the / from (13.8.11) this follows directly from (I3.5.3L To see this, it suffices to 
note that the inverse function theorem yields an inverse that automatically depends 
smoothly on all parameters that the original function depends upon (in our case, 
the if / = f{x, Oi) {a representing any parameters), then f~^ is the solution of 

the implicit equation F{x, y, a) = f{y, a) — x = 0, so the implicit function theorem 
yields the claim. 

It remains to verify (i) and (ii) from 13.01 Here, (ii) is automatic because for g = e 
we get g ■ f = f ■ Thus we are left with proving that 

pr(")h(pr(")g. (xo,4”))) =pr(")(/r-g)(xo,4”4 (3.8.3) 

To see this, let (xq, Mq"^) G and pick a smooth function / defined near xg with 

= pr("’i/(a:o). Then 

(io,4"'^) =g - (xo,Ug'"^) = (io,pr^"'^(g- /)(io))- 

To determine the left hand side of (I3.8.3P we have to find any smooth function F such 
that pr("')F(io) = pr("^((/ • /)(io), calculate h ■ F and then determine pr(”^h • F at 
the corresponding point. Naturally, we take F := g ■ f. Then using the terminology 
from (13.5.21) we obtain 

pr(”)/i(pr(”)p • (xo,4”4 = ( 2/1 (io, (ff • /)(io)), pr^"'^(/i ■ P)(S/i(io, (ff • /)(io)))) 
Here, 

2/i(io, {g ■ /)(io)) =^h{g- {xq, Uq)) = proji(/i • {g ■ {xq,uo))) 

= proji((h • g) ■ {xo,uo)) = Eh.g{xo,uo), 
and, bv 13.5.21 h ■ F = h ■ {g ■ f) = {h ■ g) ■ f, so altogether we get 

pr(”)h(pr(”)g- (xo,4”4 = i’E.h-gixo,uo),pT‘^'^\{h ■ g)f){Eh.gixo,uo))) 

= pr("')(/i-g)(a:o,4"4 

□ 
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3.8.2 Example. Again let G = SO{2) be the rotation group on M = = A x [/. 

We determine the first prolongation of G. We have p = q = 1, so A x =]R^, 
with coordinates (x, u, Ux)- If u = f(x) is a local smooth function then pr*^^)/(x) = 
(/(x), /'(x)). Now let (x°, G A X and let £ be an angle of rotation. We 

wish to determine 

prW£.(x°,n°,u°) = (x°,n°,u°). 

To this end, let / be the Taylor polynomial 

/(x) = + u°(x — x°) = • X + — w°x°), 


which satisfies /(x°) = /'(x°) = Then bv l3.5.1l we get 


/(x) = (£• /)(x) = 


sin e + it° cos e 
cos e — sin e 


— M°X° 


cos e — sm e 


which is well-defined for ^ cote. By (|3.5.1I) we have x° = x° cose — vP sine, so 
either again by (13.5.11) or by inserting we get 

vP = /(x°) = x° sin e + vP cos e. 


Finally, we get for the first derivative 


= /'(i°) 


sin e -I- cos e 
cos £ — sin e 


Combining this and dropping the 0-superscripts we obtain the action of pr(^^5'0(2) 
on A X 


(i) , . ( . . , sme-|-it 2 ,cose \ 

pr^ ’£ ■ (x,u,Ux) = X cose — usme, xsme -I- u cose,-:—, , 

\ cos e — Ma; sm e / 

defined for |e| < |arccot(Ma;)|. The important point to note is that although the 
action of SO{2) is linear and globally defined, its first prolongation is nonlinear and 
only locally defined. 


In the previous example we see that the first prolongation of G acts on the original 
variables (x, u) in exactly the same way as G itself. This is actually a general 
phenomenon. In fact, it follows directly from the definition of pr^^^G via evaluating 
derivatives of / that the action of pr^^^g on (x, ) with k < n coincides with that 

of pr*^^). In particular, pr^^^G = G. To give a concise formulation of this property 
we introduce the natural projections 

TT^ : 

^nx,7r(-)) = (x,u«). 


where (for k < n), consists of the components m“, U J < A: of For example, 
if p = 2, g = I, then 

TTQ{x,y;u;Ux,Uy;Uxx,Uxy,Uyy) = {x,y;u) 

: y : : Uy ^ Uxx 7 ^xy: Uyy') — 1 / 7 j ) ■ 


Then for any k < n and any g G G we have 


^^opr(")5 = prWp. 


(3.8.4) 


In particular, this observation allows to calculate prolongations of group actions 
inductively. 
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3.9 Invariance of differential equations 

In Section 1X71 we have identified any given system S of differential equations 
Pi,{x, = 0 {i/ = I,... ,1) with a subvariety Sp of This identification will 

allow us to utilize the methods of symmetry group analysis of algebraic equations 
in the context of differential equations. The essential translation result between the 
two realms is the following theorem 

3.9.1 Theorem. Let M be an open subset of X x U and let S: 

be an n-th order system of differential equations defined on M, with corresponding 
subvariety Sp of M^'^K Suppose that G is a local transformation group acting on 
M whose prolongation leaves Sp invariant, i.e., 

G 5p ^ pr(")g • G Sp, 

whenever defined. Then G is a symmetry group of the system S in the sense of 

roxi 

Proof. Let u = f{x), /: SI —>■ [/ be a local solution of P{x,u^'^'>) = 0. This means 
that the graph 

= {(a;,pr("V(a;)) | a; G fl} 

of pr*^"^/ is contained in iSp. If g G G is such that g ■ f is well-defined then, by the 
very definition of pA^^g, the graph of pr(”)(g • /) is given by the image of the graph 
of pr^”)/ under pr^^^g, i.e., 

Since Sp is invariant under pr^^^g, it follows from this that the graph of pr(")((ji • /) 
is contained in Sp as well. But this just means that g ■ f is a local solution of the 
system S. □ 


3.10 Prolongation of vector fields 


To make the calculation of symmetry groups of differential equations accessible to 
the infinitesimal methods we developed for algebraic equations we need to be able 
to effectively calculate the infinitesimal generators of prolonged group actions. 


3.10.1 Definition. Let M G X xU be open and suppose that v is the infinitesimal 
generator of a local one-parameter transformation group e i—>■ Fl^. Then the n- 
th prolongation of v, denoted by pr(”)i;, is the local vector field on which 

is the infinitesimal generator of the corresponding prolonged one-parameter group 
pr(")Fl^.- 

d 


pr(”)(Fl")(x,u(”)), 


= -T 

at Q 

for any (x,u<^"^) G 

Note that from this definition, together with 13.1X1 it immediately follows that 


pr(")(Fi«) =Flf 


M, 


(3.10.1) 
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Any vector field v on M = X x U can be written in the form 

p <? 

'v\(x,u) = '^C{x,u)d^i + y] (j)a{x,u)du<-. 

i—1 Oi—l 

Its n-th prolongation, being a vector field on therefore has to be of the form 

= '^C{x,u)d^. +'^'^cl)i{x,u'''^'>)duj, (3.10.2) 

i—1 Q—1 J 

where the last sum is over all J with 0 < ftJ < n. Note that due to (13.8.411 the 
prolonged group action agrees with the original one on = M, so the coefficients 
and (/)° = (j)a agree with those of v. Moreover, for the same reason, if (jj = fc 
then the coefficient </>;( of du‘^ will only depend on /c-th and lower order derivatives 
of u, (j)i = i.e., 

r 7 r(l(pr(’")u) = pr('=)u n>k, (3.10.3) 

where pr*^'’^ = v. Again this reflects the possibility of calculating the coefhcients 
recursively. Our main goal will be to derive a general formula for calculating the 
4>i given (j)a. 

3.10.2 Example. Returning once more to the rotation group G = SO{2) on 
from 13.5m 13.8.21 with infinitesimal generator v = —udx + xdu and group action 

FI" {x,u) = £ ■ (x, u) = (x cos £ — u sin e, x sin £ + u cos e) 


we have 

\ f • • sine + itj; cos e \ 

pr^ ’ 1L (x, u, Ux) = X cose — usme, xsme + u cose,-, , 

\ cos e — Mx sin e / 

By 13.10.11 the first prolongation of v is calculated by differentiating this expression 
at e = 0. This gives 

pr^^^n = -udx + xdu + (1 + ul)du^ (3.10.4) 

As predicted by (13.10.311 . the first two terms coincide with those of v itself. 

In order to be able to apply the criterion 13.2.131 to the present situation we need a 
suitable maximal rank condition for systems of differential equations. 


3.10.3 Definition. A system 

Pi,(x, u^"^) = 0 iy = l,...,l 

of differential equations is said to be of maximal rank if the I x {p + ) Jacobian 

matrix 




aa dp^ 


9x® ’ duf 


of P with respect to all its variables (x,u^'^^) is of rank I whenever Pfx^udd) = 0. 


3.10.4 Example, (i) The 2-dimensional Laplace equation P = Uxx + Uyy = 0 is of 
maximal rank since the Jacobian of P with respect to (x, y; u; Ux,Uy] Uxx, Uxy, Uyy) G 
X X is 

Jp = (0,0;0; 0,0; 1,0,1), 
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which has rank 1 everywhere. 

(ii) The equation P = {uxx + Uyyf' = 0 is not of maximal rank, because 
Jp = (0, 0; O 5 0, O 5 2(Uxx Uyy)-i 0, 2{^Uxx “f 
vanishes on P = 0 . 

3.10.5 Remark. In practice, the maximal rank condition is not much of a re¬ 
striction, i.e., ‘most’ systems automatically satisfy it. In fact, if Sp = | 

P{x, = 0} is a regular (/-dimensional) submanifold of then locally around 
any point in we can pick an adapted coordinate system (cf. [TOl 1 . 1 . 12 ]), and 

thereby (extracting the relevant coordinates) find a smooth map P of rank I such 
that Sp = Sp = {(x, u^"^) I P{x, = 0}. 

Using the above condition we can now formulate the main result on the calculation 
of symmetry groups of systems of differential equations: 

3.10.6 Theorem. Let 


Pi/(a:, u^"^) = 0 1^=1,...,/ 

be a system of differential equations of maximal rank defined on an open subset M 
of X X U. If G is a local transformation group acting on M and 

pr("):y[p^(a;^ y("))] = 0, z/ = l,...,/, whenever P{x,u^'^^) = 0 (3.10.5) 

for every infinitesimal generator v of G, then G is a symmetry group of the system. 

Proof. By 13.9.11 it suffices to show that Sp remains invariant under pr^^^G. By 
13.8.11 pr("^G is a local group of transformations on whose infinitesimal gen¬ 

erators are exactly the pr("^z;, for v the infinitesimal generators of G Ibv 13.10.1]) . 
Since P is of maximal rank, the result now follows from 13.2.131 □ 

We shall see in 13.12.41 that if P satisfies a certain local solvability condition then 
(13.10.51) is in fact necessary and sufficient for G to be a symmetry group of the 
system. 

The Hadamard Lemma l3.2.15l allows the following equivalent reformulation of (13.10.5^ : 
there exist smooth functions = Qvy,(,x,u^"''>) such that 


i 

pr(«)^,[p^(a;^y("))] = '^Qvffx,u'^'^^)Pffx,u'^‘^^) (3.10.6) 

holds identically on 

3.10.7 Example. Continuing our analysis of the rotation group 50(2) on from 
13.10.21 consider the first order ODE 

P(x, u, Ux) = {u — x)ux -b u -b a: = 0. (3.10.7) 

The Jacobian of P is 

Jp = (dxP, duP, du^P) = {I - UxA + Ux, u - x), 
hence has rank 1 everywhere. 


65 




















We now apply the infinitesimal generator pr^^^^ti from (13.10.41) to P: 

pr(i)^;(p) = _|_ (1 

= -u(l - Ux) + x{l + Ux) + (1 + UxY{u - x) 

= Ux[(u — x)Ux + U + x] = UxP- 

Therefore, (13.10.511 (or also (13.10.61) 1 is satisfied, and we conclude that the rotation 
group transforms solutions of (|3.10.7I1 to other solutions. 

The calculation of even the first prolongation of the infinitesimal generator of the 
rotation group in 13.8.21 demonstrates that the direct method of first determining 
the prolonged group action and then calculating the corresponding generators is 
not feasible in practice. We need to find an algorithmic way of directly calculating 
prolongations of vector fields, avoiding the detour of determining the prolonged 
group action along the way. 

Before deriving a general formula, let us start out by considering some special cases 
first. Let 

V = ^C{x)dxi 

i=l 

on M C X X U, with U = M.. According to (I3.5.2p . the corresponding group action 
(/e = Fig then is of the form 

(i, u) = ge- (x, u) = {Es{x),u), 


where 


de 


%{x)=C{x). 

0 


(3.10.8) 


On we have coordinates = {x'^,u,Uj) with Uj := dxiU. By definition, 

to calculate pr^^^u in (x,u^^'>), we may take any function / with u = f{x) and 
Uj = dxif{x) for j = 1,... ,p. Then 




where x = ^e(x), u = u, and Uj are the derivatives of the transformed function 
fs= 9e- /, which, by (13.5.31) , is given by 

U = fe{x) = /(S-I(i)) = /(S_e(i)). 

Therefore, 

= !&<*> = £ 

fc=i 

Since S_£(i) = x, this simplifies to 


^ d'E.^ 

k=l 

This is the explicit formula for the action of on the Uj-variables. To find the in¬ 
finitesimal generator of pr^^^g^, we have to differentiate this expression with respect 
to e at e = 0. Also, since pr^^^g^ acts on {x,u) as g^, the coefficients of dxi and du 
in pr^^^u stay the same. Hence 

pr(i)i; = ^C{x)dxi +^(ly’{x,u^^'^)du^, (3.10.10) 
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where 




de 


P 




0 h= 


k=l 


Here, using (13.10.81) . we have 


A. 

de 


0 L 


a: 




dx^ 


-i^eix)) 


= -fejW + L 






- £=0 


and the second term vanishes since .no = id. Therefore, 

(jP (X, U, U:r) = 


k=l 


(3.10.11) 


The second special case we look at is a local group action as above, only this time 
acting exclusively on the dependent variable: 

(x, u)= Qe- (x, u) = {x, $£(a:, u)). 

In this case, v = (j){x, u)du, where 

d 


(t>{x,u) = 


de 


^e{x,u). 


0 


Let / be a local smooth function with f{x) = u. Then f^=g^-fis given by 

U = fsix) = <^eix,f{x)). (3.10.12) 

To determine the prolonged group action, we need to differentiate /g: 


dfe 


5$, 


5$, 


df 


Consequently, pr^^^^g^ • (x,u^^'>) = (a;,it(^^), where 


a^E a^r 


" axi + au 


(3.10.13) 


As before, to determine (jP in 


= V + (jP{x,A^^)au 
1=1 


we have to differentiate (13.10.131) at e = 0: 


(jP{x,A^'^) = — 


acj) a4> 


(3.10.14) 


We have here the first instance of a generally useful operation, namely that of total 
derivative: 

(jPix,A^^) = -^[^{xj{x))]. 

Thus <jP(x, u'P'^) is calculated from (j)[x, u) by differentiating with respect to x^ while 
treating u as a function of x. Accordingly, the total derivative of 4> with respect to 
x^, Dj is defined as 

n ^ - AALx. At 

More generally, we define: 
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3.10.8 Definition. Let {x,u^^'>) i—>■ P{x,u^'^'>) be a smooth function of x, u, and 
derivatives of u up to order n, defined on an open subset of X x The 

total derivative of P with respect to a;* is the unique smooth function DiP{x,u^'^~^^'^) 
defined on with the property that if u = f{x) is any smooth function of x 

then 

AP(a;,pr("+i)/(a;)) = f[x))]. 

Thus DiP is calculated by differentiating P with respect to x* while treating all 
and their derivatives as functions of x. 


3.10.9 Lemma. For any P(x, we have 


D,P = 


dP 

dx'- 


EE "3,. 

a=l J 


dP 

duf 


where, for J = (jd,... ,j^), 

a _ ^ 

dx^dxL ... dxti^ ’ 

and the sum in (I3.10.15[l extends over all 0 < jjJ < n. 
For example, with X = U = R we have 


(3.10.15) 


(3.10.16) 


^ ^ dP dP dP dP dP 

DxP — I" I" '^xxT{ I" '^xyT{ ^ '^xxxT{ h ■ ■ • 

OX OU OUx OUy OUxx 

^ ^ dP dP dP dP dP 

PyP — I" ^ ^ ^ '^xxyT{ h ... 

dy du dux duy duxx 

Higher order total derivatives are defined inductively: let J = {ji, ■ ■ ■, jk) be a fc-th 
order multiindex with 1 < Jk < p for each k, then we set 


Dj Dj,^Dj2 ■.. Djj,. 

Also, note that the order of differentiation does not matter for total derivatives: 
DjDi = DiDj for all j, I. 

In the proof of the general prolongation formula we will need the following auxiliary 
result: 


3.10.10 Lemma. Let e i—> M(e) be a smooth map that takes values in the space of 
invertible n x n-matrices. Then 

i[M(e)-l = -M(c)-«M(e)-‘ 

Proof. It suffices to differentiate the identity M{s)M{s)~^ = / by the product 
rule. □ 


3.10.11 Remark. As a final prerequisite for the proof of 13.10.1^ below we need 
the observation that given any open subset M oi X x U, one can view the (n + 
l)-st jet space as a subspace of the first jet space of the n-th 

jet space. In fact, any (n + l)-st derivative uf can be viewed as a first order 
derivative of an n-th order derivative (which, in general, can be done in many 
different ways). For example, let p = 2, g = 1, so that the coordinates on 
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can be written as {x,y;u;Ux,Uy). Then we may view {ux,Uy) as new dependent 
variables, Ux = v, Uy = w. This turns into a subset of X x IJ, where X is 
still but U has three dependent variables u, v, w. Consequently, the first jet 
space of turns into an open subset of X x with coordinates 

(x, y; u; v; w; Ux, Uy, Vx, Vy,Wx, Wy). Since we defined v = Ux, w = Uy it follows that 
M(2) C is the subspace defined by the relations 


V =Ux, W = Uy, Vy = Wx 


in X X . Here, the third relation is a consequence of ' 


^xy — ^yx' 


3.10.12 Theorem. (The general prolongation formula) Let 

V = '^C{x,u)-^ + '^(l)aix,u) 


a=l 


du° 


(3.10.17) 


be a vector field on an open subset M Q X x U. The n-th prolongation of v is the 
vector field 

= u + (3.10.18) 

a—l J * 


defined on C X X where J runs over all multi-indices J = (ji, ■ ■ ■, jk) 

with 1 < Jk < p, 1 < A: < n. The coefficient functions (f'f are given by 




(3.10.19) 


i=l 


i=l 


where uf = dxiU°‘ and = dx^uf. 

Proof. We start out with the case n = 1. Let g^ = FI” and 
{x, u)= ge- {x, u) = (Se(a;, u), $e(a;, u)). 

Then 

d 


C{x,u) = 


de 


,{x,u) = — 
de 


lix,u), i = l,...,p, 


$“(x,u), a=l,...,q. 


(3.10.20) 


Given any {x,u^^^) S pick a local smooth map / with pr(^^/(x) = i.e., 

with u“ = f°‘{x), uf = dxif°‘{x). By (I3.5.3L for e small and restricting the domain 
of / sufficiently, we have 

u = Mx) = {ge ■ f)ix) = [«>e O (id X /)] O [S^ O (id X f)]~^{x). 

By the chain rule, we conclude from this that for the Jacobian Jfe = (9/“/9i®) we 
have (using that a; = o (id x /)]“^(i)) 

Jfe{x) = J[<^e o (id X f)]{x) ■ [J[E^ o (id X f)]{x)]~'^ (3.10.21) 

The matrix entries of Jf^ix) then give formulae for pr^^^^p^. 

To calculate pr^^^u we have to differentiate (13.10.211) at e = 0. In doing this, note 
that since po = id, we have 

'^o{x,f{x))=x, ^o{x,f{x))=f{x), (3.10.22) 


69 








so with I the p x p unit matrix we get 


J[^o o (id X f )]{x) = /, J[$o o (id x f)]{x) = Jf{x). 


Therefore, using Id. lO.lHl 


de 


Jfsix) 

0 


de 


J[$£ o (id X /)](a;) - Jf{x) ^ 

0 


J[Se o (id X f )]{x) 

0 


J[(/)o(idx /)](a;)-J/(a;)-J[Co(idx f)]{x). 


The matrix entries of this expression give the coefficient functions of du<^ in 
The (a, fc)-th entry is 


(l^tix,pT^^'^f{x)) = ^[cj)^{x,f{x))]-J2^- -^{CixJixm 


2=1 


or, in terms of total derivatives, 


4>^{x,U^^'>) = Dk4>a{x,U'^'^) -'^DkC{x,u^^'>)u7 


2 = 1 


= Dk 




2 = 1 


(3.10.23) 




2 = 1 


with = &^u°^ldx^dx\ This is (ld.10.191) for n = 1. 

In the general case we proceed by induction. Here we make use of the key observa¬ 
tion [^TUTTI which allows us to view the in + l)-st jet space as a subspace 

of the first jet space of the n-th jet space. The strategy is to regard 

as a vector field and then use the case n = 1 above to prolong it 

to Then we restrict the resulting vector field to the subspace 

of and this will give the n-th prolongation of v. The fact that this is 

possible and gives a well-defined result will follow from the explicit formula below. 

The coordinates on are given by Mj^. = where J = (ji,... , 

1 < A: < p, and 1 < a < g. By (Id.l0.dll . the only new coefficients we need to 
calculate are those of the highest order derivatives By (ld.l().2dL they are 

given by 

= (3.10.24) 

i=l 

To finish the proof, it suffices to show that the explicit formula (13.10.191) solves the 
recursion relation (13.10.2411 in closed form. This follows by induction: supposing 
that (j)'^ has already been shown to satisfy (I3.10.19|) , it follows from (13.10.2411 that 


iJ,k ^ 


= Dk 




2=1 


2=1 




J,i 


2=1 


2=1 


= DkDj Ua - ^ e< + • ui, + eu%,) - DkC • u% 

\ 2=1 / 2=1 

= DkDj +^eui,k, 


where Mjjfe = d^u'^/dx'^dx^. Thus is of the form (13.10.191) as well, which 
completes the induction step. □ 
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3.10.13 Remark. Note that (I3.10.24p provides a useful way of calculating prolon¬ 
gations of vector fields recursively. 


3.10.14 Example. As an illustration of the relative ease with which one may 
calculate prolongations of vector fields using the general prolongation formula, we 
once more return to the rotation group G = SO{2) acting on cf. 13.10.71 The 
infinitesimal generator is u = —udx + xd^ so (j) = x and ^ = —u. Bv 13.10.1^ we 
have 

pr(i)u = v + 

where 

(j) = D T ^'^XX — I^x{x ~\~ Wlix) = 1 “t“ 

which confirms (|3.10.4|1 . To calculate the coefficient (f>^^ of in pr^^^^u, we can 
either use (I3.in.l9|) : 

(f) = D^{(f) ^Ux) T ^Uxxx — ^x(.^ ffi UUx) 'U'Xxxx — ^'^x^xxj 
or the recursion formula (13.10.241) : 

(j) = Dx4^ Uxx^x^ — 4 ” '^x) 4 ~ ’^x^xx — ^UxUxx- 

Consequently, 

pr^^^u = -udx + xdu + (1 + ul)du^ + iUxUxxdu^^- 

Note that to derive this result via the second prolongation of the group action (as 
in 13.8.211 would be much more involved. 

Based on this, and using the invariance criterion in 13.10.61 it follows that the ODE 
Uxx = 0 has iS'0(2) as a symmetry group. In fact, 

W^'^'’v{uxx) = iuxUxx = 0 whenever Uxx = 0. 


Since the solutions of Uxx = 0 are linear functions, this is basically just the statement 
that rotations take straight lines to straight lines. 

Next, consider the function 


k{x, 


^XX 

(1+^4)^’ 


(3.10.25) 


describing the Frenet-curvature of a planar curve. Then one easily checks that 
pr(^^u(«:) = 0, so 13.2.71 shows that k is an invariant of pr(^^S'0(2). Geometrically, 
this means that the curvature of a planar curve is invariant under rotations. 


3.10.15 Theorem. Let v, w be smooth vector fields on the open subset M of X x U. 
Then 

(i) pr(")(cu + w) = c ■ pr*^”)u + pr^^^w (c G R). 

(ii) pr("^[u,w] = [pr^^^u, pr(")w]. 

Proof, (i) This is immediate from l3.10.12l 

(ii) This also could be verified using 100.121 but we take a different route. First, 
bv 13. 8. II we have 

pr(")(g./r) = (pr(")g)-(pr(")/i). 

Also, defining {g + h) ■ x := g ■ x + h ■ x we obtain 

pr(")(5 + /i) = pr(’")5 + pr(”)/r 
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directly from the definition (13.8.21) (using sums of representing functions to represent 
sums of points in the jet bundle). Also, pr^"ddM = idM(") j and v !->■ pr^”)^ is clearly 
continuous. Finally, recall from (I3.10.1|) that 

pr(")(Fi«) 


so we can use 12.1.201 to calculate as follows: 

Fip^';:^“ o Fip--';:^" o o fp;:!"’" - idM(n) 

[pr(”)n,pr(”)w;] = lim ^ - 

E^O £ 

pr(») [Fr ^ o Ff^ o Fi:;, o FF^ - idM] 

e—^0 £ 


= pr 
= pr''"' [w, w 


lim 
€—^0 


Fr^oFr^oFi:;),oFr^-idM 


□ 


If we call a vector field v an infinitesimal symmetry of a system S of differential 
equations if it satisfies (I3.10.5L then we obtain from the previous theorem: 


3.10.16 Corollary. Let S be a system of differential equations of maximal rank 
defined on M C X x U. Then the set of all infinitesimal symmetries of S forms a 
Lie algebra of vector fields on M. 


To conclude this section we note the following equivalent way of writing the general 
prolongation formula f3.10.12l For v a vector held on M C A x {7, let 

p 

Qa{x,u^^'>) := (l)a{x,u)-'^C{x,u)uf, a = l,...,q. (3.10.26) 

The g-tuple Q{x, uF)) = ^ Q^) is called the characteristic of the vector held 

V. Using this, (13.10.191) becomes 


= DjQ^ + ^Cu% 




Substituting this into f|3.10.18p . a brief calculation gives 

= E E + 1 f ‘ f + E E “3,.^., ■ 


a=l J 

which, by (I3.10.15L means that 


duf ^ \ dx'^ 

1—1 \ a—1 J 


pr(”)w = pr*^”)uQ + ^ C A, 

i=l 


P 


where 


(3.10.27) 


(3.10.28) 


(3.10.29) 


VQ := 


:= ^Qa(x,wA)_^ —. (3.10.30) 


a=l J 


du°, 


Here, the last equality comes from applying (13.10.2811 to vq. In all of the above 
equations, the sums over J extend over all J with 0 < JJ < n. Note that the 
individual terms on the right hand side of (13.10.291) actually involve (n + l)-st order 
derivatives of u, and only their combination gives a genuine vector held on 
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3.11 Calculation of symmetry groups 


In this section we want to illustrate how to explicitly calculate the symmetry group 
of any given system S of differential equations. From the previous sections, we 
obtain the following general strategy for tackling this problem: 

(i) Make an ansatz for an infinitesimal generator u of a prospective one-parameter 
symmetry group of S in the form (13.10.171) with unknown functions and 

(ii) Calculate pr^"^?; according to (j3. 10. 1811 . (I3.in.l9|) . where n is the order of S. 

(iii) Insert the resulting expression into (13.10.51) . Since these equations only have 
to hold on iSp, eliminate any dependencies of the derivatives of the u’s by 
using the equations in S. The equations achieved in this way have to hold 
identically in x, u and the remaining partial derivatives of the u’s. 

(iv) Solving the resulting so-called defining equations produces a number of (usu¬ 
ally elementary) partial differential equations for the unknown functions 
and (j)a- 

(v) Compute the and fia from these equations. By 13.10.161 the vector fields 
gathered from this procedure form a Lie algebra of infinitesimal symmetries. 

(vi) The corresponding one-parameter symmetry groups are found by calculating 
the flows of the respective infinitesimal symmetries. 

Let us systematically work through this list in a concrete example: 


3.11.1 Example. Consider the one-dimensional heat equation 

Ut — Uxx (3.11.1) 


Thus we have X = with variables x, t, and U = R, and the equation P{x, t, = 
Ut — Uxx determines a subvariety in X x For the infinitesimal generators of 

symmetries of 5 = {P = 0} we make the ansatz 


d d d 

V = i{x,t,u)— + T{x,t,u)— + (j){x,t,u) 


'dt 


du 


(3.11.2) 


Our aim is to determine all possible coefficient functions r, (j) such that FI" is a 
symmetry group of the heat equation (13.11.11) . Since we wish to apply [O0.6l we 
will need the second prolongation of v, 


pr^^^ti = t) -I- , 


du. 


+ . 


dut 


dux 


dUr 


dut 


Inserting this into (I3.10.5|) , it follows that the only restriction on the coefficients of 
V is that 

(jfi = (jf”” whenever ut = Uxx- (3.11.3) 

Using (I3.10.19|) . we calculate 


4>* = Dfifi - ^Ux - TUt) -I- iUxt + TUtt = Dt4> - UxDt^ - UtDtT 
= fit- ^tUx + ifiu - Tt)ut - ^uUxUt - Tuul 


(3.11.4) 
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and 


^xx ^ 

XXX “ 1 “ '^'^xxt 

~ ^x^ ^x^x^ ‘^tDj.T ‘ZUxxDx^ ‘^Uxt^xT 

— 4*xx “t“ (^‘^4^xu ^xx')'^x '^xx'^t ~\~ i^uu ‘^^xu'jUx ‘^'^xu^xUt (3.11.5) 

^uu'^x '^uu'^^x^i {4^u *^^x^Uxx ‘^Tx^xt ^^uUxUxx 

'^u'Ut'Uxx ‘^'^uUxUxt- 

If we insert these expressions into (13.11.31) and replace Ut by Uxx wherever it occurs, 
we obtain an equation on the jet space X x that has to hold identically in 
all the remaining variables. We may therefore ‘split’ the equation by equating the 
coefficient functions of all monomials appearing in it. This gives the following table: 


Monomial 

Coefficient 



0 

II 

1 

to 

(a) 

'^xt 

0 

II 

1 

to 

(b) 

^xx 

— '^U 

(c) 

uluxx 

0 — '^uu 

(d) 

^x^xx 


(e) 

^xx 

4^u '^t — '^xx H“ 4^u ^^x 

(f) 

nl 

0 — ^uu 

(g) 

ul 

0 — 4 ^uu ^^xu 

(h) 

Ux 

- ‘^^xu ^xx 

(j) 

1 

— 4 ^xx 

(k) 


We first note that (a), (b) imply that r is a function of t only: r = T{t). Using this, 
(e) shows that ^ does not depend on u, and (f) gives n = 2^x- Therefore, 

= ^Ttx + a{t), 

where a is some function of t. (h) shows that (j) is linear in u, so 

t, u) = Pix, t)u + a{x, t) 

for some functions a, /3. By (j), —2Px = = ^xttx + cr*, so 

P{x, t) = -^Tttx^ - l-atx + p{t). (3.11.6) 

o Z 

Finally, (k) implies that both a and /3 must be solutions of the heat equation: 

Pt — Pxx- 

Combined with (13.11.611 we get 

Xttt — 0; — 0; Pt — 

Hence r is quadratic in t, a is linear in t, so for suitable constants Ci we get: 

r = C 2 + 2 c 4 t + 
cr = Cl + 2c^t 

1 

Pt = = -2c6 ^ P = -2cet + C3 

^ P = -\tuX^ - l-atX + p = -cqx'^ - C 5 X - 2 cet + C 3 

o Z 
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Consequently, 


^ = -Tt + cr = Cl + C4X + 2c^t + AcQxt 
T = C 2 + 2 c 4 t + 4 :Cet^ 

(p = (iu + a = {c3 — C5X — 2 cet — cex'^)u + a{x, t), 

where ci,..., cg are arbitrary constants and a is any solution of the heat equation. 
It follows that the Lie algebra of infinitesimal symmetries of the heat equation is 
spanned by the six vector fields 

vi = dx 
V 2 = dt 
V 3 = udu 
V4 = xdx + 2tdt 
V 3 = 2 tdx — xudu 
vq = Atxdx + 4:t^dt — {x^ + 2t)udu 
and the infinite-dimensional subalgebra 


Vc = a{x,t)du, 

where a is any solution of the heat equation. By 13.10.161 we know that these 
generators span a Lie algebra of vector fields. Thus with any two generators also 
their Lie bracket is an infinitesimal symmetry of the heat equation. 

The one-parameter symmetry group actions Fl^* corresponding to the generators 
can be calculated by solving the ODE 

■^{x{e), t{e), u{e)) = Vi{x{e), t{e), u(e)) 
de 

with initial value {x{0),t{0),u{0)) = {x,t,u). This gives the groups 


Gi 

G 2 

G 3 

G 4 

Gs 


G, 


(x + e, t, u) 

(x, t + e,u) 

(x, t, e^u) 

(e^a;, u) 

{x -\- 2st, t, *) 

X t 


6 • 


1 — 4et ’ 1 — 4et 
Ga '■ {x,t,u + ea{x,t)) 


, u\/l — 4£fei'-‘'* 


Using (13.5.31) we conclude that, given any solution / of the heat equation, so are 
the following functions (given by gG') ■ /, for € Gi): 


uG) = f(x-e,t) 
uG) = f[x,t-e) 
y(3) ^ e^f{x,t) 
uG) = f{e-^x,e-^H) 

J5) = e-^^+^'‘f(x- 2 et,t) 


11 ( 6 ) — - pl+4etj 

VI + 4st 

y(a) _ f(^x,t) +ea{x,t). 


X ^ \ 

1 + 4et ’ 1 + Aet J 
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Here, £ is a real number and a is any solution of the heat equation. G 3 and Ga 
are simply an expression of the linearity of the heat equation. Gi and G 2 reflect 
the space- and time-invariance of the equation. Gi is a scaling symmetry, and Gs 
represents a kind of Galilean boost to a moving coordinate frame. Gq is a genuinely 
local transformation group. Applying Ge to a trivial solution u = c {c a constant) 
we obtain that also 

C — 

U = - p.l + 4et 

\/l “h 4i6t 

is a solution. Setting c = we obtain the fundamental solution to the heat 

equation at the point {xo,to) = (0,—l/(4e)). Translating in t (using G 2 ) we arrive 
at the fundamental solution at (0, 0), 

1 ^ 
u = , e ‘‘t . 

y/Ant 

The most general one-parameter group of symmetries has a generator of the form 
ciVi+- —h ceVe + Va ■ Also, an arbitrary element of the symmetry group of the heat 
equation (if sufficiently close to the identity element) can be expressed in the form 


9 


= FI 


Va 

Sa 


o o ■. 
^6 


oFl"), 


and so the most general solution obtainable from any given solution u by such a 
transformation is of the form 


u = 


y/l + dSgt 


e^x+EQX'^ —e^t 
l+4e6t 


e ^*{x— 2 e 5 t) e , , , 

“ ^ - £2 + a{x, t), 


1 -f 4egt 


1 + Asgt 


with £ 1 ,..., Eg real constants and a any solution of the heat equation. 


3.12 Nondegeneracy conditions 

The central result IVlO.bl onlv gives a sufficient condition for a local group of trans¬ 
formations to be a symmetry group of a given system of differential equations. In 
the present section we will see that under some mild condition on the system the 
infinitesimal criterion from 1X1 0.61 is in fact also sufficient. 

Let us first look at the difference between the criterion 13.2.131 for the invariance 
of a system of algebraic equations, which is necessary and sufficient, and 13.10.61 
For a system of algebraic equations F = 0, for each point xq on the subvariety 
Sf = {x I F(x) = 0} there trivially exists a solution to the system, namely xq 
itself. In the case of a system P{x,u^^'>) = 0 of differential equations, however, 
if (xo,Uq"^) € Sp = {(x,u^’^^) I F(x,u^"^) = 0} there is no guarantee that there 
exists a solution u = f{x) of the differential equation locally around xq such that 
= pA'^'>f{xo). Our definition 13.5.31 of a symmetry group of P{x,u^^^) = 0 
requires pr^^^G to transform solutions into solutions, so we only can infer that it 
will move points in iSp for which there exists a solution as above to other points in 
Sp. In general, these points need not constitute the entire set Sp. Thus we define: 


3.12.1 Definition. A system P{x,u^^^) = 0 of n-th order differential equations is 
called locally solvable at the point 

(xo,ul;^) eSp = {(x,u(’^^) I P(x,u(")) = 0} 


if there exists a smooth function u = f{x) defined nearxo such thatu'^^ = f{xo). 
The system is called locally solvable if it is locally solvable at every point of Sp. It 
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is called nondegenerate if it is locally solvable and of maximal rank at every point 
e Sp. 

3.12.2 Remark. Consider a non-singular ODE of order n, i.e., of the form 

Uji — P{x^ w, Ux-, • ■ • 5 'an—if (3.12.1) 

with Uk = for any k and let (a;°, ..., ) S 5p. Then we need to find a local 

smooth function u = f{x) such that 

u° = fix°), = ul = f^-\x°). 

Here, the first n conditions are the usual initial conditions for the ODE (I3.12.ip . 
hence can always be satisfied by standard ODE solution theory. The last condition 
then simply follows from (j3.12.1ll itself (i.e., from the fact that (a;°, ..., ) G 

Sp). We conclude that non-singular (systems of) ODEs are always locally solvable, 
hence are in fact nondegenerate. 

Note that even for PDEs, local solvability is usually a very mild restriction - e.g., 
in Cauchy problems one usually is looking for solutions with prescribed values on 
an entire hypersurface. 

3.12.3 Example, (i) The 2-dimensional wave equation 

P{x, t, = utt -'axx = 0 
is locally solvable. In fact, let 

{x°,t°-u°-ul,u°;ul^,ult,u°t) G Sp, 

i.e., = vffx- Then we need to find a solution u = f{x) of the wave equation near 

(a;°,t°) with f{x°,t°) = (u°, m?; u?t). Since < = we can take for 

/ the polynomial solution 

f{x,t) = u°+u°{x - x°) +ui{t-t°) + ]^vfix[{x - x°)^ + {t- t°)^] 

+ ultix - x°){t - t°). 

(ii) The over-determined system 

'Ux = ya, Uy = 0 (3.12.2) 

is not locally solvable: given (a;o,j/o)) let 

■u° = M(a:o,yo) = 1, yfi = Ux{.Xo,yo) =yo, vPy = Uy{xo,yo) = Q. 

Then there is no local solution with the prescribed jet at (xo,yo). In fact, by 
cross-differentiation we obtain 

0 = Uxy = {yu)y = yuy + u = u, 

which is incompatible with the above prescription. 

(iii) Apart from integrability conditions as in (ii), the second main reason why a 
PDE may fail to be locally solvable is that there are no smooth solutions, even 
locally. The most famous example is due to H. Lewy ([la), who showed that there 
exist smooth functions h = h{x, y, z) such that the first order system 

Ux-Vy + 2yuz + 2xvz = h{x, y, z) 

Uy + Vx- 2xuz + 2yvz = 0 

has no smooth solutions even locally around any point. 

(iv) For analytic systems of PDEs (in Kovalevskaya-form), the Cauchy-Kovalevskaya 
theorem ensures local solvability. 
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For nondegenerate systems of differential equations we can now show that indeed 
(Id. 10. 511 is necessary and sufficient: 

3.12.4 Theorem. Let 


be a nondegenerate system of differential equations defined on an open subset M of 
X xU. Let G be a local transformation group acting on M. Then G is a symmetry 
group of the system if and only if 

y("))] = 0, i>=l,...,l, whenever P{x,u^^^) = 0 (3.12.3) 

for every infinitesimal generator v of G. 


Proof. The condition is sufficient bv l3.1Q.(Tl To see that it is also necessary, note 
that bv 13.2.131 condition (13.12.311 is equivalent to pr^^^G mapping Sp = {{x, | 

P(x, = 0} into itself. Thus we have to verify that any symmetry group G of 
the system has this property. Let (xq, Uq"^) G Sp. Then by local solvability we may 
find a smooth function u = f(x) defined near xq such that = pr("^/(xo). If 
g G G is such that pr^^^g • {xo,ul^'^) is defined, then by shrinking the domain of / 
if necessary, we can obtain that f = g ■ f is a well-defined function near xq, where 
(xqjWo) = g • {xq,uo). Then by (13. 8. 211 we have 


pr^”^ 5 - (xo,4”^) = (xo,pr^"^(5 •/)(xo)) =: (xo,m(,"0 . 
Since f = g ■ f is a solution, this shows that (xq, Uo"^) G Sp, as claimed. 




□ 


3.13 Integration of ordinary differential equations 

As an illustration of the power of the methods developed in this chapter, in the 
present section we investigate the general problem of solving ordinary differential 
equations (ODEs). It turns out that all the standard solution techniques for special 
types of ODEs are instances of symmetry methods. Moreover, knowledge of a 
sufficiently large group of symmetries of any given ODE allows to reduce the solution 
procedure to successive integrations (or quadratures, in classical terminology). 

Consider a first-order ODE 

du 

^=F{x,u). (3.13.1) 

We will show that if a one-parameter symmetry group of (13.13.111 is known, then it 
can be solved by integration. Let G be a local one-parameter group of transforma¬ 
tions acting on an open subset M oi X x 17 = , with infinitesimal generator 

V = ^{x,u)^ + fi{x,u)-^. (3.13.2) 

By 13.10.121 its first prolongation is given by 

pr(i)x = ^{x, u)-^ + (j){x, u)-^ + (jf[x, u, (3.13.3) 

OX OU OUx 

where 

(j) = Dx^f^ UxDx^ = 4>x ^x)ux ^u'^x' (3.13.4) 
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The infinitesimal criterion (13.10.51) for v to generate a symmetry of (I3.13.ip then 
reads 


dx \du dx) du ^ dx ^ du' 


(3.13.5) 


for all {x,u). Thus any solution to the PDE (I3.13.5|) generates a symmetry of the 
ODE (13.13.11) . Although at first sight, replacing the ODE (13.13.11) by the PDE 
(13.13.51) may not seem to facilitate the problem, in practice one can often find 
symmetries directly, e.g. by geometric considerations. 

Suppose that v generates a symmetry of (13.13.11) and let v\(^xo,uo) 0- Then by 
straightening out (see [TTJ 17.14]) we may pick new coordinates 


y = T]{x,u), w = C{x,u) 


(3.13.6) 


near (xo,uo) such that in the new coordinates we have v = d^, and thereby 


( 1 ) ^ 
pr*- >v = V = 

aw 

The condition that (13.13.11) be invariant under v then simply means that it must 
be independent of w, i.e., it can be written in the form 


dw 

dy 


H{y) 


for some smooth function H. This equation can then be solved trivially by integra¬ 
tion: 


w = 


H{y)dy+ const. 


Transforming back to the (x, u)-coordinates gives the solution to (|3.13.1I1 . 

To find the required coordinate transform, note that in the new coordinates we 
require (cf. [H 2.5.3]) that 


V = 


, . 9 d 


A 

dw' 


so in terms of (x, u), we need to satisfy 




ou 


(3.13.7) 


Here, the first equation means that rj is an invariant of G, hence can be found by 
the method of characteristics indicated in (13.3.31) . Once r] is known, ^ can often be 
found by inspection. In any case, also the second equation can be solved by the 
method of characteristics. As indicated above, there is no guarantee that solving 
(13.13.71) is indeed easier than solving the original equation (13.13.11) . For example, if 


^f^=F{x,u) (3.13.8) 

§(x,m) 

then inserting (j) = ^ ■ F in ()3. 13.51) gives an identity, i.e., finding a symmetry in this 
case is exactly the same problem as solving the original equation. 


3.13.1 Example. Consider the homogeneous equation 
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This equation is invariant under the group of scaling transformations 


G : {x, u) H> {Xx, Xu), A > 0. 


Indeed, by (ld.ld.4L the first prolongation of the generator v = xdx + udu is v 
itself and so one readily checks that (ld.ld.5|) is satisfied. New coordinates satisfying 
(13.13.711 are given by 


u 


w = logcc. 


X 


Then 


du du/dy x{l + ywy) 

dx dx/dy xwy 

so the equation in the new coordinates becomes 


1 + ywy 

Wy 


dw 1 

dy F{y)-y' 

which can be solved by integration: 


w = 


dy 

Fiy) - y 


+ c. 


3.13.2 Example. Consider the equation P dx + Q du = 0. We show that if 
it possesses a one-parameter symmetry with generator v = ^dx + (j>du, then the 
function ^ 

^ ^ ^{x,u)P{x,u) + (j){x,u)Q{x,u) 

is an integrating factor. 

To see this, note first that the given ODE is of the form (13.13.11) with F = —PjQ. 
Inserting this into (13.13.51) and recalling 13.12.41 and 13.12.21 it follows that u is a 
symmetry if and only if 



dP 

du 


Q- 




OX 




PQ - = 0. 

au 


(3.13.9) 


On the other hand, that R is an integrating factor means that 




and inserting R from above here gives 



which is equivalent to (I3.13.9p . as claimed. 


Consider now a non-singular ODE of order n: 

P{x, = P{x, U,Ux, - ■ ■, Un), (3.13.10) 

where Un = ■ We will show that knowledge of a one-parameter symmetry group 

G of (13.13.101) with generator v allows to reduce the order of the equation by one. 
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To see this, choose coordinates y = r]{x, u), w = ({x, u) as in (13.13.71) that straighten 
out V, so that V = dw Then by the chain rule, we can re-write the derivatives of u 
with respect to x in terms of y, w and the derivatives of w with respect to y. 


d^u f dw d^w\ 

for suitable functions 6 k- Substituting these expressions back into (13.13.1011 . we 
obtain the equivalent n-th order equation 

= P{'y,w,Wy,...,Wn) = 0 (3.13.11) 


in the new coordinates. Now note that since (13.13.101) has G as a symmetry group, 
the same is true of the transformed system (13.13.1111 : this follows from the definition 
of a symmetry group and the fact that the prolongation of a group action according 
to (13. 8. 211 is equivariant under changing variables (again by the chain rule). In the 
new variables, pr^"^r; = v = dw, and so the infinitesimal criterion (I3.10.5p reduces 
to 

dP 

pr(")^(p) = __ = 0 whenever P{y,w^'^^) = 0. 

Since P is invariant under G^"^ bv l3.3.8l and l3.12.4l ('and provided that G acts semi- 
regularly with orbits of constant dimension) there exists an equivalent equation 
that depends exclusively on a complete set of functionally independent invariants 
of G*-"^. In the present situation, such a set of invariants is obviously given by 
y, Wy,Wyy,... ,Wn- Cousequcntly, there is an equivalent equation 

P{y,Wy,...,Wn) =0, 


which is independent of w. Thus, setting z = Wy we obtain an equivalent ODE of 
order n — 1, as claimed: 

- jn —1 , 

Given any solution z = h{y) of this ODE, w = f h{y) dy+c is a solution to (13. 13. lip , 
and transforming back to {x, u)-coordinates gives a solution to (|3. 13. 1011 . 


3.13.3 Example. Any homogeneous linear ODE of second order 

Uxx + pix)ux + qix)u = 0 (3.13.12) 

is invariant under the scaling group G : {x,u) {x,Xu). In fact, the generator of 

G is w = udu, so pr^^^i; = udu + Uxdu,^ -I- Uxxdu^^x^ so the infinitesimal criterion 
(13. 10. 511 is satisfied. Coordinates satisfying (13. 13. 711 are given by 

y = X, w = logM, 

so V = dw- To re-express the equation in the new coordinates, note that 

U — e , Ux — rCajC , Uxx — (y'^xx P U^j;)c , 


giving 


Wxx + wl+ p{x)wx -I- q{x) = 0, 


which, as expected, is independent of w- Setting z = Wx = Ux/u we therefore obtain 
the well-known transformation of (13.13.121) into the Riccati equation 


Zx = -Z^ - p{x)z - q{x)- 
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3.14 Differential invariants 


In this section we look at a converse to the problem of determining the symme¬ 
try group of a differential equation, namely: Given a local transformation group, 
what is the most general type of differential equation that admits it as a symmetry 
group? We will give an answer to this question in the setting of ordinary differential 
equations. 

Recall from l3.12.4l that a nondegenerate differential equation = 0 admits 

a local transformation group as a symmetry group if and only if the corresponding 
subvariety Sp of is invariant under pr*^”)G. Provided that pr^^^^G acts semi- 
regularly with orbits of constant dimension, 13.3.81 then implies that there is an 
equivalent equation P = 0 describing the subvariety Sp and depending only on 
the functionally independent invariants of pr^^^G. We therefore give a name to the 
invariants of pr*^")G: 

3.14.1 Definition. Let G be a local transformation group acting on M Q X x U 

and let n > 1. An n-th order differential invariant of G is a smooth function 
r] : —>■ R such that r] is an invariant o/pr*-"^G.' 

r?(pr^”^(; • (x, = t ]( x , 

for all (x, G and g £ G such that pr(")g • (x, is defined. 

3.14.2 Example. For the rotation group G = SO(2) on X x U = we have 
V = —udx + xdu, and by (I3.10.4|) we have 

pr^^^u = -udx + xdu + (1 + ul)dua, 

By definition, the first order differential invariants of G are the invariants of pr*^^^G, 
which can be calculated by the method of characteristics, cf. (13.3.311 . This gives the 
following complete set of first order invariants of G: 

—y xux - u 

y=\/x^+u^, w = -. 

X -I- UUx 

Any other first order invariant must be a function of these fbv 13.3.71) . 

Our next aim is to show that one can always calculate higher order differential 
invariants inductively from known ones of lower order. To show this we need an 
auxiliary result first. 

3.14.3 Lemma. Let v = f^dx -I- (fdu be a vector field on M C X x U = and let 

C = be smooth. Then 

pr(”+i)u(D,C) = D,[pr(")u(C)] - Dxf ■ Dxf. (3.14.1) 

Proof. By (13.10.2911 . 

pr("+i)u(D,C) = pr(”+i)uQ(D,C) 

and 

D,[pr(")u(C)] = D,[pr(")uQ(C)] + DxifDxC). 

It therefore remains to show that 

pr("+i)uQ(D,C) = Dx[pr(”)^^Q(C)]. 
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Using (I3.10.30|) . we calculate: 



where in the penultimate line we used that ( = ((x, hence its derivative with 

respect to Un+i vanishes. □ 


3.14.4 Proposition. Let G be a local one-parameter group of transformations 
acting on M <Z X x U = Let y = and w = ((x,u^'^^) be n-th order 

differential invariants of G (n> 1). Then the derivative 


dw dw/dx _ DxC 
dy dy/dx 

is a differential invariant of G of order n + 1. 


(3.14.2) 


Proof. Note first that for any vector field Z and any smooth functions /, g we have 
the usual quotient rule Z{f fg) = g~'^{Z{f)g — f Z{g)), as follows from the derivation 
description of vector fields. Using this and (I3.14.1L we obtain from (13.14.21) : 


pr 


("+i)„ I ^ 

dy 


1 


(D.vr 

1 


(pr('‘+i)u(Zl,C) • ■ pr("+i)u(Zl,ry)) 

(Ha;[pr(”)u(C)] • Da:r] - 


- D,C ■ + D,C ■ ■ D^p) = 0 

since C and p are n-th order invariants, so pr("^u(^) = pr("^(? 7 ) =0. □ 


3.14.5 Corollary. Let G be a local one-parameter group of transformations acting 
on M C X X U = K.^. Let {y = p{x,u), w = w,Ua,)} be a complete set of 
functionally independent invariants o/pr^^^G. Then (locally around any point) the 
derivatives 


y,w 


dw d” 

^ dy' ' dy'^~^ 


form a complete set of functionally independent invariants for pA'^'^G for n > 1. 


Proof. Successively applving l3.14.4l it follows that the derivatives up to order k are 
invariants of pr^^^G, hence also of any higher prolongation of G (by (13.8.41) 1. So it 
only remains to show that they are functionally independent. But this is immediate 

ifc 

because depends explicitly on Uk+i, hence is independent of the previously 
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constructed invariants y,w, ..., since those only depend on x,u,... ,Uk- 

Bv l3.3.7l since G has one-dimensional orbits and dimX x = n+2, completeness 
follows. □ 


3.14.6 Example. Applying the previous result to the first order invariants of the 
rotation group given in 13.14. l?l it follows that y, ru, and 


dw dw/dx 
dy dy/dx 


\/ x"^ + 

(x + UUxY 


[{x^ + V?)Uxx 


(1 -I- ul){xUx - u)] 


together form a complete set of functionally independent invariants for pr^^^^G. Thus 
any other second order differential invariant can be written as a smooth function 
of these invariants. For example, the curvature n from (I3.10.25p is an invariant of 
pr^^^G (as one checks by verifying that pr(^)u(K) = 0), and in fact 


Wy 


w 


K = 


{1 + (1 -I-?ii2)3/2 j/(l-P 


The important point to note now is that once the differential invariants of a local 
group of transformations GonMCXxU = M.'^ are known, we can determine all 
ODEs that admit G as a symmetry group, i.e., all ODEs that can be integrated 
using G: 


3.14.7 Theorem. Let G be a connected local transformation group acting on M G 
X X U = 'Mf and let rf-{x,u^^'^),... ,r]^{x,u^'^^) be a complete set of functionally 
independent n-th order differential invariants of G. 

(i) IfpA'^^G acts semi-regularly on then an n-th order nondegenerate dif¬ 

ferential eguation P{x,u^^^) = 0 admits G as a symmetry group if and only 
if, locally around any point in Sp = {{x,u^'^'>) \ P{x,u (")) = 0} C M("), there 
is an eguivalent equation 

involving only the differential invariants of G. 

(a) If G is a one-parameter group of transformations, any nondegenerate n-th 
order differential equation having G as a symmetry group is equivalent to an 
(n — l)-st order equation 


P(y,w, 


dw 


dy'"'' dy' 


n — 1 


= 0 , 


where y = r](x,u), w = (f{x,u,Ux) form a complete set of functionally inde¬ 
pendent invariants o/pr^^^G. 


Proof, (i) Clearly if there is an equivalent equation depending only on invariants 
then Sp itself is locally invariant. Since P is non-degenerate, 5 is a closed regular 
submanifold of , so the proof of l3.2.13l shows that this implies that S is invariant 
under pr^"^G, which in turn yields that G is a symmetry group of P. Conversely, 
if G is a symmetry group then the proof of 13.12.41 shows that pr(")G leaves Sp 
invariant. The claim then follows from 13.3751 

(ii) This is immediate from (i) and 13.1431 □ 
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3.14.8 Example. Continuing our analysis of the rotation group from 13.14.61 we 
can use l3.14.71 to find the most general first and second order ODEs that admit the 
rotation group as a symmetry group. Bv 13.14.71 (ii), any such first order equation 
is equivalent to one involving only the first order invariants given in 13.14.21 If we 
solve for w it follows that any such equation is of the form 

XUx — u 
X + UUx 

Analogously, any second order equation invariant under rotations is equivalent to 
one only involving y, re, and, say, the curvature k = ; hence is of the 

general form 

Uxa^ = (1 + 

V a; + UUx ' 

We may also employ 13.14.71 to obtain an alternative way of reducing the order of a 
given differential equation = 0 once we know a one-parameter symmetry 

group. In fact, we know from 13.14.71 (ii) that the equation must be equivalent to 
one involving only y,w, ..., . But this latter equation automatically is of 

order n — 1. Thus simply by re-expressing the equation in terms of the differential 
invariants already reduces the order by 1. Furthermore, once the solution w = h(jj) 
of the reduced equation is known, the solution of the original equation is found by 
solving the auxiliary first order equation 

Cix, u, Ux) = h{r]{x, u)) (3.14.3) 

obtained by substituting for y and w their expression in terms of x and u. As 
(13.14.31) depends only on the invariants 77, ( of pr^^^G, it has G as a one-parameter 
symmetry group, hence can be integrated using the methods described above for 
first order equations. 

3.14.9 Example. The second order equation 

x^Uxx + XUx = (3.14.4) 

is invariant under the scaling group G : {x, u) 1 —>■ {Xx, Xu). In fact, the infinitesimal 
generator of G is n = ^ {Xx, Xu) = xdx + udu- Hence (13.10.191) gives 

pr^ V — xQx “t“ udu Uxx^uxx^ 


F{-\/x^ F u^). 


so 

pT^‘^\{x‘^Uxx + XUx ~ UUx) = ‘2x^Uxx + XU^ — UUx — x'^Uxx = 0 

for any {x,u,Ux,Uxx) G Sp. Bv 13.14.51 and lT13.ll a complete set of functionally 
independent invariants of second order is given by 

u dw dw/dx x'^Uxx 

X dy dy/dx xUx — u 

Inserting this into (13.14.41) gives the transformed equation 


/ \ dw 2 

{u’-y)^+w =yw, 
dy 


which, as expected, is of first order. This has two families of solutions: either w = y 
(which is a singular solution), or ^ = —w, which implies w = ce~^ . Re-substituting 
gives 


du u 
dx X ’ 


or 


du 

dx 
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This results in the one-parameter family of singular solutions u = kx, and the 
implicit solutions 


/ 


dy 

ce~y — y 


logx -I- k, 


with y = u/x, which constitutes the general solution to the original equation 

(13.14.411 . 


3.14.10 Example. We return to the two-dimensional heat equation ut = Uxx, 
whose symmetry group we calculated in l3.ll.ll and use the methods of this section 
to determine the general form of traveling wave solutions to the heat equation. In 
general, a traveling wave solution to a differential equation is one which is invariant 
under a translation group. Here we consider the symmetry of the heat equation 
given by the translation group 

{x,t,u) 1-^ {x + ce,t + e,u), £ G R, 

with generator v = dt + cdx, for c some fixed constant. Global invariants are given 
by 

y = X — ct, V = u. (3.14.5) 

Thus any group-invariant solution is of the form v = h{y), or it = h(x — ct), which 
is a function that doesn’t change its shape along the straight line x — ct = const. If 
we express the derivatives of u in terms of the new variables we obtain 


Ut - Ux - Vy^ Uxx - 

Inserting into the heat equation, we arrive at the ODE determining all traveling 
wave solutions of the above form: 


CVy — Vyy, 


with general solution 

v{y) = ke~'^^ + I 

(k, I arbitrary constants). If we now substitute back into the heat equation we 
obtain the general form of traveling wave solutions: 

u{x,t) = + 1. 

Much more can be said about group-invariant solutions of PDEs. In fact, there 
is an entire theory that allows to calculate solutions invariant under any subgroup 
of the full symmetry group of any given system of differential equations. While 
these methods also basically rest on choosing new variables from invariants of the 
given sub-group, the case of higher (than one) dimensional symmetry groups is 
significantly more involved than what we considered in this section. For a precise 
formulation, the theory of extended jet bundles and group invariant prolongation is 
required. We refer to [B Ch. 3] for an in-depth study. 
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Chapter 4 


Variational symmetries 


Conservation laws (like conservation of energy, momentum, etc.) play a central role 
in physics. In contemporary theoretical physics one usually formulates the governing 
principles of a theory in the form of a variational principle - roughly, this says that 
some relevant quantity is to take an optimal value. It is a deep and far-reaching 
discovery of Emmy Noether (1918) that for such systems, every conservation law 
comes from a corresponding symmetry property. In the present chapter we give an 
introduction to this field. 


4.1 Calculus of variations 

Let X = W, with coordinates (a;^,..., x^), and U = W with coordinates (u^,..., 
u’^). Also, let n be a connected open subset of X with smooth boundary cAI. By a 
variational problem we mean the problem of finding extrema (maxima, minima, or 
stationary points) of the functional 

C[u] = ( L{x^u^^\x)) dx 

Jq 

in some class of admissible functions u = fix) on fl. The integrand L : X x —>■ 

R is called the Lagrangian of the variational problem C. The class of admissible 
functions varies with the problem under consideration (e.g., through the imposition 
of various boundary or regularity conditions). We will confine ourselves here to 
studying smooth variational problems. The calculus of variations is a vast subject 
and we will here barely be able to scratch the surface, with a focus on symmetry 
methods. For a gentle introduction see, e.g., m, a rather comprehensive treatise 
is [5l[6]. 

4.1.1 Example, (i) To find the shortest curve u = f{x) connecting two points 
(a, b), (c, d) in the plane, one needs to minimize the length of u = fix): 

C[u\= f ^/l + dx. 

J a 

(ii) More generally, if (M, g) is a smooth Riemannian manifold and x, y £ M then 
the problem of finding the shortest curve from a; to y is a variational problem, 
namely that of minimizing the arc-length of curves u: [0,1] —> M connecting x and 

y- ^ 

C[u]= [ ^giu'it),u'it))dt. 

Jo 
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Solutions to this problem are called geodesics of {M,g). 


The basic approach to finding extremals of variational problems is very similar to 
that of extremizing smooth functions in basic real analysis. There, given a function 
/: n —>■ R, one notes that if x is an extremum of / then for any y, the one¬ 
dimensional function e i—>■ f{x + ey) must have an extremal. Therefore, we must 
have 


0 = 


de 


f{x + ey) = {'S/f{x),y) {y G 


and so the gradient V/ of / at x vanishing is a necessary condition for x being an 
extremum of /. 


For functionals /l[u], the role of the gradient is taken over by what is called the 
variational derivative. Moreover, the inner product on M" gets replaced by the 
L^-inner product of functions /, g: —>■ : 


if, 9 )= [ f{x)-g{x)dx= [ ^/“(x) 5 “(x)dx. 
JQ JQ 


q ;=1 


With these notations we can define: 


4.1.2 Definition. Let C,[u] he a variational problem. The variational derivative of 
C is the unique q-tuple 


5C[u] = (<5i£[m], ... ,(5q£[u]), 

5C : C°°{n) —>■ with the property that 

C[f + eri]= f dC[f]{x) ■ r](x) dx (4.1.1) 

0 

for every smooth function f on fl and any compactly supported smooth function rj G 
'DiTlY such that f + eg is itself admissible. 8aC is called the variational derivative 
of C with respect to u°‘. 

Here, by T>(H) we denote the space of test functions on H, 21(f2) = {/ g \ 

supp(/) (E H}. 

4.1.3 Proposition. If u = /(x) is an extremal of C[u], then 

5C[f]{x)=0 VxGH. (4.1.2) 


de 


Proof. For any g G 'D{TlY and for all £ small, f + eg is admissible, and by 
assumption, e 1 —> C[f + eg] must have an extremum at e = 0. Hence by classical 
real analysis, 


0 = 


de 


m + sv]= [ SC[f]{x) ■ g{x)dx. 
) Jci 


As this has to hold for any 77 , the claim follows. 


□ 


Using the notion of total derivative, cf. (13.10.151) , we now wish to derive an explicit 
formula for the variational derivative. First, 


de 


0 



L(x, pr^"^(/ -I- eg){x)) dx 
— (x,pr(")/(x)).ajr7“(x)) 


dx. 
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Since r] has compact support, we may apply the divergence theorem to this expres¬ 
sion, with all boundary terms vanishing. Note also that when a partial derivative 
dxj is applied to dLjdvPj then the result can be expressed by a total derivative, so 


d 

de 


£[f + er]] 

0 


^ 

pr("+#‘^V(a:))] ?7“(a;) dx. 

a—1 J 


(4.1.3) 


Here, for J = (ji,... ,jk) we set {—D)j := {—Dj^){—Dj^) ... This formula 

is the first instance of the following operator that will play a central role below. 


4.1.4 Definition. For 1 < a < q, the a-th Euler operator is defined by 

J J 

where the sum extends over all J = (ji,... ,jk) with 1 < jn < p, k > 0. Moreover, 
we set E{L) := {Ei{L ),..., Eq{L)). 

Although the above sum is formally infinite, whenever Ea is applied to some con¬ 
crete L{x,u^^^) only finitely many terms are required. 

In terms of the Euler operator, (14.1.31) says that 

d£ = E{L) 

Together with 14.l!^ this gives the classical necessary condition for extremals of a 
variational problem: 

4.1.5 Theorem. If u = f{x) is a smooth extremal of the variational problem 

£\u] = L(x,u^'^^) dx then it must be a solution of the Euler-Lagrange equations 

Ea{L) = 0, a = 

□ 


4.1.6 Example. Let p = q = 1, i.e., we consider a variational problem for a 
real-valued function of one variable. In this case, 


E = 

3=0 


d 


d 


dui du 


d 

dUr 


Dt 


d 


dux 


Thus the Euler-Lagrange equation for an n-th order variational problem 

rb 


£[u] = j L{x,u^^Ydx 

J a 


is given by (setting Uj = ^) 

0 = E{L) = ^-Dx^ + DI-^ -+ 

Oil Ollx Olixx Ollfi 

This is an ODE of order (at most) 2n. The most common case of a first order 
variational problem L = L{x, u, Ux) gives 

d^L 


0 = EiL) = ^ - Dx-^ 

au out. 


dL 


d^L d^L 

— u 


o '^XX o ■ 

ououx out 


du dxdux 

In particular, for the curve length problem 14 .1.1 1 the Euler-Lagrange equation reads 


-Dx 


[l + ulfD 


= 0 , 


V \/i + / 

i.e., Uxx = 0. As geometrically expected, the solutions are straight lines u = kx + d. 
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4.1.7 Example. The Dirichlet principle: Consider the variational problem of 
minimizing the total energy (kinetic plus potential) of some system in the form 


C,[u\ = J ^\\7u\^ — uhdx 

with some external potential h. Then the Euler-Lagrange equation reads 



i.e., we obtain the Poisson equation —Au = h. 

4.2 Variational symmetries 

In this section we want to develop a notion of symmetry that applies to variational 
problems, similar to the symmetry groups of differential equations studied in the 
previous chapter. Consider a variational problem 



(4.2.1) 


Let G be a local transformation group on an open subset MoiilQxUCXxU. 


If u = /(x) is a smooth function on a sufficiently small subdomain 11 C Hq such 


that the graph of / lies in M then any transformation g € G sufficiently close to 
the identity will transform / into another smooth function u = /(x) = {g ■ /)(x) 
defined on some H C fig- Therefore, we define: 

4.2.1 Definition. A local transformation group G acting on M Q fig x U is called 
a variational symmetry of the functional (14.2.111 if whenever is a subdomain with 
n C Hq, u = f{x) is a function defined over 11 with graph contained in M, and 
g € G is such that u = /(x) = {g ■ f)(x) is a well-defined function defined on 
n C Oq, then 



(4.2.2) 


Our first aim, in line with our previous considerations on symmetries of differential 
equations, is to derive an infinitesimal criterion for variational symmetries. For this, 
we need a few preparations. 

To begin with, we need to study how a variational problem transforms under the 
action of a local transformation group. Let 


x = ^{x,u), u = 4>(x,tt) 


(4.2.3) 


be any change of variables. Then there is an induced change of variables 


u(”) = $(”)(x,u(”)) 


for the derivatives, given by prolongation (i.e., by differentiating ()4.2.3p by the 
chain rule). If the conditions of the inverse function theorem are satisfied then 


(14.2.31) determines a transformed function u = fix). Thereby, any functional 
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is transformed into a new functional 

^[f] = / L(i,pr(")/(x))di. 

JQ 

Here, the new domain 17 = {a; = S(a;, f{x)) | a; G 17} depends both on 17 and on /. 
By the change of variables formula we get 

L(a:,pr(")/(x)) = f{x)) det f{x)), (4.2.4) 

where J is the Jacobian matrix with entries 

J*-^(a:,pr(^V(a;)) = -^B.\xJ{x)) = Dj'~\x,pY^^'> f{x)) i,j = (4.2.5) 

where, for simplicity, we assume that det J(x) > 0 , otherwise we have to add 
absolute value signs. 

4.2.2 Definition. Let x = {x^ ,. . . ,x^) and let P = P{x,u‘'^^) = {Pi{x,u^'^^),... , 
Pp (a;, «("■))) be a p-tuple of smooth functions. Then the total divergence of P is 

Div P = D\P\ + ■ ■ ■ + DpPp, 

where Dj denotes the total derivative with respect to x^. 

With these notations, we have: 

4.2.3 Lemma. Let v = £fdx' + Sa be a vector field on M C LIq x U 

and set := Fl^ = (S^^, $ 5 ^). Denoting by Jg^ the Jacobian (14.2.51) corresponding 
to , we have 

■^{detJg^{x,u^^'>)) = (Div^)(pr(^)ge • (a;,^^^^)) • det Jg^(x, (4.2.6) 

Proof. Recall first that for matrices A, B with A invertible, the derivative of the 
determinant function at A in the direction B is given by det'(H)(P) = det(H) • 
tr(H“^P). Consequently, for any fixed function u we get 

Adet Jg^ =det'(Jgj(^Jg^) =det Jg^ •tr(J-/^JgJ. (4.2.7) 

By (I4.2.5L Jg^{x,u^^^x)) = dx{^g^{x,u{x))) (with dx abbreviating the x-gradient, 
and similar for di, 82 below), so 

^(Jg,(x,u(^))(x)) = 9,,(^5g^(x,u(x))) = 5^(^(Sg^(x,u(x)),4>g^(x,u(x)))) 

= {diOi^ge (x, u(x)), $g^ (x, u{x)))dx{^g, (x, u(x))) 

+ id 2 f)iEg^ (x, u(x)), 4>g^ (x, uix)))dx ($g, (x, u(x))) 

= [{diOi- ■ •) + 92f{. ■■)■ dx{<^g, {x, u{x))) ■ dxi^g, (x, u{x)))-^]dx{Eg^ (x, u(x))) 

Now let u be the transformed function g^ ■ u, so that with x(x) = Sg^(x,ii(x)) we 
have u(x(x)) = d>g^(x,u(x)). Then 

dx(^gA^^'^(^))) = dx(uo£)(x) = d£u(x(x))-dxx(x) = d£u(x(x))-dx(Sg^(x,u(x))), 
so, together with the above calculations we obtain 

^(Jg,(x,u(^))(x)) = [(5iC)(x,u(x)) + (d 20 (x,u(x)) ■ dxu(x(x))] ■ Jg^(x,u(^^x)) 
= Dxf(pr'^^^ge(x,J^^(x))) ■ Jgjx,u^^^(x)). 
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Finally, using (14.2.71) . and the fact that tr{AB) = tr(i3A), we arrive at 

(x))) = det{Jg^{x,u^^\x))) ■ tr(Ds^(pr(^)ge(a;, ^(^^(x)))) 

= det(Jg^(x,'u(^^(x))) • (Div^)(pr(i)ge(x,M(^)(x))). 

□ 

The infinitesimal criterion for variational symmetries now is given by the following 
result. 


4.2.4 Theorem. Let G be a connected local transformation group acting on M Q 
LIq X U. Then G is a variational symmetry group of the variational problem (14.2.11) 
if and only if 

pr(”)w(L)+L-DivC = 0 (4.2.8) 

for all (x, and every infinitesimal generator 

i —1 a —1 

ofG. 


Proof. As usual, for g G G we write (x,m) = g ■ {x,u) = (Sg(x, m), $g(x, m)). By 
definition of a variational symmetry, see (I4.2.2L we get 


f L(x, pr^"^/(x)) dx = ( f{x)) dx 

Jn Jn 

[ L{x{x),pT^'^'>f{x{x)))detJgix,pT^^'>f{x))dx 
Jn 

Since this equality has to hold for all subdomains Ll and all functions /, the inte¬ 
grands must agree pointwise, so g is a variational symmetry if and only if 


L(pr(")5- (x,u(")))det Jg(x,w(i)) =L(x,n(”)) V(x,u(")) G 


(4.2.9) 


for all g such that both sides are defined. We now insert g = Pe = FI" and differen¬ 
tiate with respect to e, which by (14.2.61) and the product rule yields 

(pr(”)x(L) + L •Div^)|(p,(„)g^.(^y„))) • det Jg^(x,M(^)) = 0 (4.2.10) 

Setting e = 0, i.e., = id, this formula reduces to (I4.2.8L proving necessity. 

Conversely, if (14.2.81) is satisfied then also (14.2.101) holds for e sufficiently small. 
Therefore, the derivative of (14.2.91) (with g = Pe) with respect to e vanishes identi¬ 
cally. Integrating from 0 to e then shows that (14.2.91) holds for any g^. Since G is 
connected the result then follows since any g G G can be written as a product of 
such g^. □ 


4.2.5 Example. Returning to (14.1.11) (i), where C[u] = y^l -I- dx, we expect 
that rotations should be variational symmetries, as they leave curve lengths invari¬ 
ant. To verify this, let v = —udx + x9„ be a generator of the rotation group on 
Then by p.l0.4|) . 

pr(i)x = -udx + xdu + (1 + ul.)du^. 

In particular f = —u, and we calculate 

pr(Fv(L) + L . Dxf = (1 + + ul- ^/l + ul■Ux = 0, 

so indeed (|4.2.8I) is satisfied. 
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Next, we want to clarify the relationship between variational symmetries of some 
variational problem and symmetries of the corresponding Euler-Lagrange equations. 
To this end, we first need to understand how the Euler-Lagrange equations of 
variational problems transform under a change of variables. 

4.2.6 Proposition. Let L(x, L{x,vS'^ he two Lagrangians related by a 

change of variables given by (j4.2.3l) . 114. 2. 41) . i.e., 

X = S(a;, u), u = $(a;, u) 


and 

Then 


L{x, = L{x, det J(x, 


K. {L){x, F^pix, (Z)(i, a=l,...,q, (4.2.11) 


3=1 


where is the determinant of the following (p + 1) x (p + 1) -matrix: 

(DiF,^ ... 


F„h = det 


7TP ^ 

du3 


(4.2.12) 


... 


Proof. Let (xq, be any fixed point in X x / be a smooth func¬ 

tion defined near xq with (xo,Uo^"^) = (xq, pr*^^"^/(xo)). By the inverse function 
theorem, on an open ball LI around xq , / is transformed by our change of variables 
into a function u = /(x). If r/ € T>(Ll)'^ and e is sufficiently small, then the pertur¬ 
bations Ue = /(x,e) = f(x) -\-eri{x), required for calculating variational derivatives, 
are transformed into functions u = f{x,e), which are determined implicitly by 

i = 2(x,/(x)-b£p(x)) =: Se(x), u = $(x,/(x)-be??(x)) =: $e(x). (4.2.13) 

Again the inverse function theorem determines the domain of each /(. ,e). More 
precisely, by [TJ Suppl. 2.5 A], the minimal radius of a ball around Xg := S(x, /(xo) + 
er]{xo)) where f{-,s) is defined can be estimated from below in terms of the second 
derivatives of the maps Sg from (14.2.131) in a neighborhood of Xq as well as the 
norm of the inverse of their Jacobians at Xq. Since 77 is compactly supported, LI is 
compact, and x^ —>■ xq as £ —>■ 0 , by choosing e small we can therefore achieve that 
all these balls contain a fixed neighborhood of xq, which we then take as a common 
domain LI of all /(. ,e), independently of e. In fact, since the radius of the ball LI 
around xq is itself determined by the mentioned estimates (with e = 0 ), we may in 
addition achieve that supp(? 7 ) is contained in 5“^ ( 11 ) for e small. 

Now set fj{x,e) := f{x,e) — f{x) and note that E!e(x) = So(x) as well as <l>e(x) = 
$o(x), and thereby /(^^(x),^) = /(E!o(x)) for x ^ supp( 77 ). This implies that 
fj{y,e) = 0 for y ^ E!e(supp(? 7 )). Again due to the compactness of supp(? 7 ) this 
shows that, for e sufficiently small, all fj{. ,e) have compact support contained in 
12. The same is therefore true of „ = ^| 

ae le=0 os 

Using this, exactly as in (14.1.31) we obtain 


d 

de 


F[f] 

0 


Eu{L) 


dl 

de 


dx, 
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£=0 


(4.2.14) 














where Eu{L) is evaluated at u = f. To continue, we need to determine 

To calculate f{x,e), we need to solve the first equation in (I4.2.1dll for x and then 
insert into the second equation. Thus we have x = x{x,s) with 


X = E(x(x, e), f(x(x, e)) + er]{x{x, e))). 

When computing variations of L, the base variables are not allowed to depend on 
e, so for z = 1 ,... we must have 


de 


o = E^. 


d^'‘ 


P ^ i 1 

, V_ 

de ^ dw 

OL — 1 


Using Cramer’s rule we can solve for (and insert e = 0), to obtain 


dx^ 

de 


1 g^^ 


£ — 0 




det J ^ ^ du° 

i—1 a—1 


where Kij is the (f, j)-cofactor (i.e., signed minor) of the Jacobian J{x) from (I4.2.5|) . 
Consequently, 


de 


=0 “=i t=i 


dx^ 

de 


£—0 


1 






det J ^ L du' 

OL—X 




dE^ 


V ■ 


Here, the term in brackets is the expansion of the determinant (14.2.121) along the 
last column, wherein ■ Kij is the row expansion of the (i,p + l)-st minor 

along its last row (recall that the necessary signs are already contained in the Kij). 
Thus we have shown that 


df^ 

de 


a—l 

If we insert this into (14.2.141) and change variables we get 


det J 

e=0 “=i 


_d 

de 



dx, 


which, on the other hand, must equal 


d 

de 


C[f + er]] 

0 



dx. 


Since r] was arbitrary, this proves that (I4.2.11|) holds at (xo,u^^^). 
Based on this result, we can now show: 


□ 


4.2.7 Theorem. If G is a variational symmetry group of the functional 

C[u\= f L{x,u^^^)dx, 

J Oq 

then G is also a symmetry group of the corresponding Euler-Lagrange equations 
E{L) = 0. 
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Proof. Let g € G and let / be a solution to the Euler-Lagrange equations. Then 
since G is a variational symmetry group, (14.2.911 shows that L{x,pA^^f{x)) and 
L(a:, pr(")/(x)) are related by the change of variables formula (14.2.411 with L = L, 
where g~^ ■ {x,u) =: {E{x,u),^{x,u)). Interchanging the roles of u and ft in (14.2.1111 
we therefore have 

Q 

^ Fai3{x,u^^'^)E^fi{L){x,u^'^'>), a = l,...,q. 

p=i 

Since / is a solution to the Euler-Lagrange equations, {L){x, pr(”)/(x)) = 0 for 
all P and all x. Thus also Euc{L){x,pr^^^ f{x)) = 0, so 5 • / is a solution as well, 
i.e., G is a symmetry group of E{L) =0. □ 


4.3 Conservation laws 

4 . 3.1 Definition. Let P(x, = 0 be a system of differential equations. A 
conservation law is a divergenee expression 

DivP = 0, (4.3.1) 

which vanishes for all solutions u = f{x) of the system. Here, for some m, 

is a p-tuple of smooth functions and Div F is the total divergenee from \4.2.iA F 
then is called a conserved current. 

4.3.2 Example, (i) Let P = A(u), where A is the Laplace-operator. Then A 
itself is a conservation law because 

Au = Div (gradit) = 0 

for every solution. Eurther conservation laws are obtained by multiplying the equa¬ 
tion with Ui = 


0 = u^Au = Dj [u^Uj - -51 Y • 

i=l k=l 

(ii) If P is any system of ODEs, with independent variable x, then any conservation 
law is of the form D^F = 0 for all solutions u = f{x) of the system, i.e., F{x, 
has to be constant along solutions, so F is a first integral of the system (cf. the 
remarks following (13.3.31) 1. In light of this, (14.3.11) can be viewed as a generalization 
of the concept of a first integral to PDEs. 

In applications to physics, often there is a distinguished time-variable t, while the 
remaining variables {x^,... ,xP) are spatial variables. Then any conservation law 
takes the form 

DtT + BivX = 0, 

where Div now is the spatial total divergence (with respect to {x^,... ,x^)). Then 
T is called the (conserved) density and X = {Xi,... ,Xp) is called the flux of the 
conservation law. They are functions of x, t, u and the derivatives of u with respect 
to X and t. 
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Suppose now that C Rp is a spatial domain and let u = f{x,t) be a solution 
defined for all a; G O and a < t < b. Consider the functional 

Tn[/](t) = [ f{x,t))dx, (4.3.2) 

Jn 

which, for /, fixed, depends only on t. Then we have: 


4.3.3 Proposition. T and X are the density and the flux of a conservation law 
of a system of differential equations if and only if, for any bounded domain C Rp 
with smooth boundary dfl and for any solution u = f{x,t) defined for x G and 
t G [a, 6] we have 

TnlfAt) - Tnlffla) = - [ [ X{x,T,pr‘^"^'> f{x,T)) ■ ndSdr, (4.3.3) 

with n the outward-pointing unit normal vector field on dVl. 


Proof. Differentiating (I4.3.2L by the divergence theorem we have 

^Tn[f]{t)=[DtT{x,t,w^""~^^^f)dx = -[ X{x,t,pT^’^~^'>f) ■ ndS, (4.3.4) 

™ JQ Jon 

so (I4.3.3|) follows by integration. 

Conversely, differentiating (14.3.311 with respect to t and applying the divergence 
theorem we obtain 

[ {DtT{x,t,pT^"^''~^^f) + DivX(a:,t,pr(™+^V)) dx = 0. 

Ja 

As this is to hold for any D and /, the claim follows. □ 


4.3.4 Example. Consider the motion of an incompressible, inviscid fluid. With 
a: G R^ representing the spatial coordinates, let u = u{x, t) G R^ denote the velocity 
of a fluid particle at position x and time t. Let p{x, t) be the density and p{x, t) 
be the pressure of the fluid. We assume that the flow is isentropic (has constant 
entropy), then p depends only on p. The equation of continuity then takes the form 


pt + Div (p • m) = 0, (4.3.5) 

where Div {pu) = ^^ 7 ^ is the spatial total divergence. Also, momentum bal¬ 

ance gives the three equations 


du^ 

dt 




1 dp 
p dx'- ’ 


i = 1,2,3. 


(4.3.6) 


Here, the equation of continuity is itself a conservation law with density T = p and 
flux X = pu. Using (14.3.41) for the functional (14.3.21) corresponding to the flux we 
get 

~r I pdx = — j pu ■ ndS. 
dt Ja Joa 

This equation has an immediate physical interpretation: p dx is the total mass 

of the fluid within D, and pu-n is the flux of fluid out of a point on the boundary of 
D. This means that the net change of mass inside D equals the influx of fluid into D 
via dVL. In particular, if the normal component u - n oi the velocity on d^l vanishes 
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then there is no change in the mass in f2 and we obtain the law of conservation of 
mass: 

/ pdx = const. 

Jo. 

Furthermore, combining (14.3.51) and (j4.3.6|) and re-arranging we get three further 
conservation laws: 

3 

Dt{pu^) 4- ^ Dj{pu^u^ + pSj) = 0, i = 1, 2, 3. 
i=i 


Again using (14.3.41) we obtain the laws of conservation of momentum: 


dt 



{pu^{u ■ n) + prii) dS, 


I an 


1,2,3. 


Here, the first term on the right hand side is the transport of momentum pu^ due to 
the flow across the surface dVL, and the second term is the net change in momentum 
due to the pressure across d^. Thus Xj = pu^u^ + p5l represents the components 
of the momentum flux. 


4.4 Noether’s theorem 


In this section we prove a fundamental result on the connection between variational 
symmetries and conservation laws, established by Emmy Noether in 1918. It is 
of central importance in mathematical physics as the source of many fundamental 
laws of nature. 


4.4.1 Theorem. Let G he a local one-parameter group of variational symmetries 
of the variational problem C[u] = / L(a;,u^"^) dx. Let 


= ^r(x,u)^ + ^(/>a(x,u) ^ 


du° 


i—1 

he the infinitesimal generator of G, and as in (13.10.261) let 


(4.4.1) 


p 

Qaix,u^^'>) := (j)a{x,u) - '^C{x,u)uf, a = l,...,q. 

2=1 


be the characteristic ofv. Then there exists a p-tuple 


such that 


DivF = Q • E{L) = ^ Qa • E^{L). 


(4.4.2) 


a=l 


Thus DivE = 0 zs a conservation law for the Euler-Lagrange equations E{L) = 0. 

Proof. Since v is the infinitesimal generator of a variational symmetry, by (14.2.81) 
and (13.10.291) we have 

p 

0 = pr(”)u(L) -I- L Div ^ = pr(”)uQ(L) + ^ f AT -4 T Div^ 

2=1 

= pr(”)uQ(L)+Div(A), 
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where := ,..., L^p). We now ‘integrate by parts’ the first term, using 


pr(’")u, 


dL 


dL 


qW — ^ DjQa ^ Qa ■ ( D)j + Div A 


a,J 

q 


Oi,J 

= ^ QaEa {L) + Div A, 


du° 


where A = {Ai, ..., Ap) is some p-tuple of functions depending on Q, L and their 
derivatives, but whose concrete form is not important for us. It follows that 


pr(")uQ(L) = Q-£;(L)+DivA 


Therefore, 


0 = Q • i;(L) + Div (A + L^), 
which means that (14.4.21) holds with F = — {A + L^). 


(4.4.3) 

(4.4.4) 
□ 


To explicitly calculate F one may in principle follow the calculation of the previous 
proof. In applications, in most cases one encounters first order variational problems. 
For these, an explicit formula is given in the following result: 

4.4.2 Corollary. Let C[u\ = f L(x, dx be a first order variational problem, 
and let v as in (14.4.111 be a variational symmetry. Then 


q p 




q;=1 




j a dL 


(4.4.5) 


form the components of a conservation law Div F = 0 for the corresponding Euler- 
Lagrange equations E{L) = 0. 

Proof. Since L depends only on derivatives up to order one, (13.10.3011 gives 

pr<%(L) = t(<3„A + £A<3„|^). 

ct—l i—1 ^ 

Now setting A := we have 

Div^ = E (^‘O-H + E■ 

ct,i ^ ct^i ^ 

On the other hand, (14.1.411 implies 


dL 


dL 


Q ■ E{L) — '^Qa 


du° 


so altogether we obtain 


pr<^^)uQ {L) = Q ■ E{L) + Div A, 


verifying (14.4.311 . 


□ 
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4.4.3 Example. Perhaps the most prominent application of the Noether theorem 
is to the mechanics of particles. Consider a system of n particles moving in 
in some force field given by a potential. The system is described by the position 
of its particles, where x“ = (a;“, ?/“, 2 :“) is the position of the a-th particle, and 
X = (x^,... ,x"’) is a vector containing the information about all positions of the 
particles. Assuming that the a-th particle has mass ma, the total kinetic energy of 
the system is 

n 

K{±) = - ^m„|x“p. 

a—l 

Also, we assume that the force field is determined by a potential U = U(x,t). 
Newton’s equation of motion then gives 


nia^ — qJJ — iUj UyOL ^ j a — l,...,7z. 


The important point to note here is that these equations are the Euler-Lagrange 
equations of the Lagrange function (or action integral) 



{K - U) dt. 


(4.4.6) 


Indeed, 


, . dL dL 


and analogously for y°‘ and 2 “ 
By (14.2.811 . a vector field 


dx.° 


dt 5a;“ ^ ^ dz°‘J 


generates a variational symmetry of the Lagrangian (14.4.61) if and only if 

v{K-U) + {K-U)DtT = 0 V(t,x). (4.4.7) 

Given a variational symmetry, by Noether’s theorem l4.4.2l we obtain a corresponding 
conservation law (or first integral) with single component (since p = 1) 


F = ^ • x“ -|- t{K — U) — ^ rx“ • {maX°‘) 


a=l 


a=l 


= • x“ - r(iL + C7) = ^ - tE, 


(4.4.8) 


where E = K + U is the total energy. Since DtF = 0 it follows that F has to 
be constant for any solution of Newton’s equations of motion. We now analyze 
some examples of variational symmetries of (14.4.611 that lead to conservation laws 
of physical interest. 


• V = dt- In this case, pr^^^z; = u, so (14.4.711 holds if and only dtU = 0, i.e., 
if and only if U does not explicitly depend on the time t. Since r = 1 and 
1 = 0, the conserved quantity given in (14. 4. 811 then is the total energy E. We 
obtain that invariance of a physical system under time translations implies 
conservation of energy. 
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• V = X]a=i ^ This is the generator of the translation x i— >■ x + a, so 

all particles are simultaneously translated in the same fixed direction a G 
Also in this case, pr^^^u = v, so (14.4.71) holds if and only v{U) = 0, i.e., if and 
only if the potential is translationally invariant in the direction a. As r = 0 
and = a for all a, the corresponding conserved quantity from (14.4.81) is the 
momentum 

n 

maSL ■ x“ = const. 

CK — 1 

In particular, if U is invariant under all translations then the total momentum 
Sa=i is conserved. 

• V = ~ This is the generator of a simultaneous rotation 

of all the masses in the system about some fixed axis, in this case the z-axis. 
By (I3.10.19p we have 


pr^^^u = V 


n 


dr 


—)■ 

dx°‘) 


Note that pr(^)u(A') = 0 (since K is invariant under rotations of velocities), 
and since r = 0, (14.4.71) shows that v generates a variational symmetry if and 
only if v(U) = 0, i.e., if and only if U is invariant under rotations around the 
z-axis. In this case, the conserved quantity corresponding to this symmetry 
via (14.4.81) is the angular momentum around the z-axis 


n 

ma{x°‘y°‘ — = const. 

a=l 


Thus rotational invariance implies conservation of angular momentum. 


In particular, if we assume that the particles only interact through their mutual 
gravitational (or electrostatic, ...) attraction, then the potential energy is of the 
form 

U[t,yL) = y]]7„,g|x“-x^|-^ 

«//3 

In this case, all of the above assumptions are satisfied, so we obtain conservation of 
energy, momentum, and angular momentum. 

To conclude this chapter we introduce a straightforward generalization of Noether’s 
theorem that also leads to conservation laws of physical interest. 

4.4.4 Definition. Let C[u] = f L(x, da; be a variational problem. A vector 
field V on M Q X X U is called an infinitesimal divergence symmetry of C if there 
exists a p-tuple = {Bi,..., Bp) of functions of x, u and the derivatives 

of u such that 

pr(")u(L) + L-Div^ = DivS (4.4.9) 

for all {x, u) € M. 

The point to note is that the Noether theorem |4.4.1l remains valid if in the hypothesis 
we replace ‘variational symmetry’ by ‘divergence symmetry’. Indeed, the only thing 
that has to be changed in the proof is that we have to incorporate the new term 
Div B into (14.4.41) , which now becomes 

Q ■ E{L) + Div (A + Lf) = Div B, 
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but we still obtain a conservation law of the form (14.4.2L namely F = B — A—L^. In 
particular, in the case of a variational problem of first order, the conserved quantity 
corresponding to (14.4.511 now becomes 




dL 
' duf 


q p 

cl-EEp 

«=ii=i 


dL 

duf 


-B, 


= 1 , 


..,p 


(4.4.10) 


4.4.5 Example. Returning to the setup of 14.4.31 consider a Galilean boost 

(t, x“) !->• (t, x“ + eta), 

for some a G Bv (l3.10.19l) . the infinitesimal generator v = X]q=i tady^a then has 
prolongation 

prm„ = y:(fa^ + a^), 

OL — 1 

SO 

n n 

pr^^^u(L) = pr*-^^ (^K — U) = ^ mQ,a ■ x°‘ — t ^ a • VqI/. 

Q;=l Q!=l 

Since r = 0, (14.4.71) demands that this expression should vanish identically, but this 
is never the case for a yt 0. However, since 

n n 

mQ,a ■ -k!^ = Dt(^ m^a • x“^ , 

CX.— \ OL — 1 

V is an infinitesimal divergence symmetry if a - VoJJ = 0, i.e., if U is translationally 
invariant in the direction a. The conserved quantity according to (14.4.lOp now reads 

n n 

t mQ,a • x“ — nriaa ■ x“. 

Q!=l a=l 

Here the first sum, when divided by the total mass is the position of the 

center of mass of the system in the direction a, and the second is the momentum in 
that direction. It follows that if U is translationally invariant in a given direction 
then on the one hand, as shown in l4.4.31 the momentum in that direction is constant. 
In addition it now follows that the center of mass in that direction is a linear function 
of t'. 


position of center of mass = initial position + t(momentum)/mass. 

In particular, if U is invariant under all translations, then the center of mass of any 
such system moves linearly in one fixed direction. 


4.5 Trivial conservation laws and characteristics 

There are certain types of conservation laws that do not yield any useful information 
on the system under consideration and hence are called trivial. A conservation law 
F = (Fi,... ,Fp) can be trivial for one of two reasons: triviality of the first kind 
holds if F itself vanishes on every solution of the system. This kind of triviality 
is usually easy to eliminate by solving the system itself and its prolongations for 
certain of the variables Uj and then substituting for these variables wherever they 
occur. For example, in the case of an evolution equation ut = P(a;,u^"^), any time 
derivative of u, e.g., Uu, Uxt, etc. can be expressed in terms of x, u and spatial 
derivatives of u. 
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4.5.1 Example. Consider the system of first order evolution equations 


Ut=Vx, Vt=Ux 


which is equivalent to the one-dimensional wave equation utt = u^x- Then 



is a conservation law for this system. As explained above, we can replace the density 
and the flux of this conservation law by ones that depend only on spatial derivatives. 
This gives 



The two conservation laws differ by the trivial conservation law 



for which both density and flux vanish on any solution of the system. In the same 
way, for any conservation law of an evolution equation there is, up to addition of a 
trivial conservation law of the first kind, one where density and flux depend only 
on spatial derivatives. 

A conservation law is called trivial of the second kind if the total divergence Div F 
in fact vanishes on all smooth functions /, whether or not they solve the equation. 
An example of this kind of triviality is the relation 


(4.5.1) 


D xUy F)yUx — 0 ; 


which obviously holds for any smooth function u = f{x,y). Hence (14.5.11) is a 
trivial conservation law of the second kind for any partial differential equation for 
functions of x and y. The underlying p-tuples (Fi,..., Fp) of conservation laws of 
the second kind are also called null divergences. 

There is in fact a complete characterization of null divergences that is a direct 
analogue of the characterization of the kernel of the usual divergence operator via the 
Poincare lemma. Since the proof of this result, building on the so-called variational 
complex, is quite involved, we only state the result and refer to [Til Sec. 5.4] for a 


proof. 


4.5.2 Theorem. Let F = (Fi,..., Fp) be a p-tuple of smooth functions depending 


onx = {x^,... ,xP), (rt^,..., and derivatives ofu, and defined on all of Xx . 
Then the following are equivalent: 


(i) DivF = 0. 

(ii) There exist smooth functions Gjk, j,k = l,...,p, depending on x, u, and 
derivatives of u, such that Gjk = —Gkj for all j, k, and 


p 



(4.5.2) 


For example, if p = 3, then 


Div F = DiFi D 2 F 2 D^F^ = 0 
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if and only if F is a ‘total curl’: F = Curl(G'), i.e., 

Fi = D2G3 — D^G2-, F2 = D3G1 — -D1G3, F^ = D1G2 — D2G1 

(where we have identified G 12 = —G 21 from the theorem with G 3 , etc.). 

We define a trivial conservation law to be any conservation law that is a linear 
combination of trivial conservation laws of the first and second kind. Thus Div F = 
0 is a trivial conservation law if and only if there exist functions Gy as in 14.5.21 (ii) 
such that (|4.5.2I1 holds for all solutions of the system. Two conservation laws are 
called equivalent if they differ only by a trivial conservation law. The interesting 
objects in the study of conservation laws therefore are equivalence classes in this 
sense. 

For the following considerations on characteristics of conservation laws we need 
some preparations. 

4.5.3 Definition. Let Pv{x,u^'^^) = 0, v = be a system of differential 

equations, with P : The k-th prolongation of this system is the {n + k)- 

th order system of differential equations 

pW(a:, = 0 

obtained by differentiating in all possible ways k times. Thus the new system consists 
■ I equations 

= 0 , 

where v = 1 ,... ,l and 0 < IJ < fc. 

4.5.4 Example. If P is the heat equation ut = Uxx, then the first prolongation 
P^^'i is the system 

— '^XXl ^xt — '^XXXf — ^xxt'i 

and the second one contains, in addition, the equations 


of the 


p + k — 1 
k 


'^XXt — ^XXXX'! '^xtt — '^xxxtl — '^xxtt- 


We then clearly have: 

4.5.5 Lemma. If u = f{x) is a solution of the system P(a;,u^”^) = 0, then it is 
also a solution of every prolongation (a;, = 0, A: = 0,1, 2,.... 


4.5.6 Definition. A system of differential equations is caHed totally nondegenerate 
if it and all its prolongations are nondegenerate (i.e., of maximal rank and locally 
solvable). 

4.5.7 Lemma. Let Pi,{x, = 0, u = 1,... ,1, be a totally nondegenerate system 
of differential equations. Let Q = Q{x,u^'^'^) be a smooth function. Then the 
following are equivalent: 


(i) Q vanishes for all solutions u = f(x) of the system. 

(ii) There exist differential operators Vi, = Qi{x, u^'^'>)Dj, v = 1 ,... ,1, such 
that 


i 


q = Y^v,p, 


for all [x, 
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Proof. (ii)=>(i) is obvious. 

(i)^(ii): Clearly we can assume that m > n. By 14.5.51 (i) is equivalent to Q 
vanishing on all solutions of the prolonged system to which is locally solv¬ 

able by assumption. Thus it follows that Q vanishes on the zero set of 
Since, moreover, is nondegenerate, bv Id. 2. 15] there exist smooth functions 

Qi (tt>^ < m — n, 1 < < 1) such that 

EE 


Let now Div F = 0 be a conservation law of a totally nondegenerate system 

=0 

of differential equations. Then by 14.5.71 Div F vanishes on every solution of the 
system if and only if there exist smooth functions Qi{x, such that 

Biy F = Y,QiDjP,. (4.5.3) 

We now note that each term in (I4.5.3P can be ‘integrated by parts’: for example, if 
1 < j < p then 

QiDjP, = D,{QiP,) - D,{Qi)P,. 

Proceeding in this way, we can cast (14.5.31) in the form 

i 

DivF = DivG ^ Q^P^ = BiyGfQ-P, 


where Q = (Qi, ■ • ■, Q/) has entries 

Q, = Y,{-D).jQi, (4.5.4) 

j 

and all we need to know about G = (Gi ,... ,Gp) is that it depends linearly on the 
components of P and their total derivatives. This means that G defines a trivial 
conservation law of the first kind. Consequently, replacing F hy F — G we obtain 
an equivalent conservation law of the form 


DivF = Q-P. (4.5.5) 

This is called the characteristic form of the conservation law (I4.5.3L and Q = 
(Qi, ... ,Qi) is called the characteristic of the conservation law. 

Unless / = 1, the Qi, from ()4.5.5I) are in general not uniquely determined. If Q and 
Q are two Z-tuples both satisfying (14.5.51) for the same P, then {Q — Q) ■ P = 0. 
Since P is nondegenerate, bv l3.2.17l it follows from this that Q — Q vanishes on all 
solutions. Based on this observation, we call a characteristic trivial if it vanishes on 
all solutions of the system, and we call two characteristics equivalent if they differ 
by a trivial one. In this terminology, characteristics are in general only determined 
up to equivalence. 

4.5.8 Example. To obtain the characteristic form of the conservation law for the 
heat equation given in 14. 5. 11 we re-write it in the form (14.5.31) : 

A(^Mt + ^ul) - D^{utUaf) = UtDtiut - Vx) + UtDx{vt - Ux). 
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Then (|4.5.4I) shows that the characteristic is given by 




and an equivalent conservation law in characteristic form can be found through 
integration by parts: 

+ Dx{-UtUx) = -Utt{ut - Vx) - Uxt{vt - Ux). 


One can show that for systems of differential equations that are totally nondegen¬ 
erate and normal (i.e., possessing a noncharacteristic direction at every point) the 
two notions of equivalence we have introduced above actually coincide: 

4.5.9 Theorem. Let = 0 be a normal and totally nondegenerate system 

of differential equations. Let the p-tuples F and F determine conservation laws with 
characteristics Q and Q, respectively. Then F and F are equivalent as conservation 
laws if and only if Q and Q are equivalent as characteristics. 

For a proof we refer to [Ml Sec. 4.3]. 

Finally, we may apply the terminology developed in this chapter to re-formulate 
Noether’s theorem 14.4.11 more precisely: 


4.5.10 Theorem. Let G be a local one-parameter group of variational symmetries 
of the variational problem £\u] = / L{x,u^'^'>) dx. Let 


V = 


2 = 1 


d 

dx^ 





be the infinitesimal generator of G, and as in (|3.1Q.26p let 


p 

Qaix,u‘'^'>) := (l)a{x,u) - '^Cix,u)ui, a = l,...,q. 

i=l 


be the characteristic of v. Then Q = {Qi,... ,Qq) is also the characteristic of a 
conservation law for the corresponding Euler-Lagrange equations E{L) = 0, i.e., 
there exists a p-tuple 

F{x, = (Fi(x, ..., Fp(x, 


such that 

q 

DivF = Q • E{L) = ^ Qa • E^{L) 

a=l 

is a conservation law in characteristic form for the Euler-Lagrange equations 

E{L) = 0. 
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